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Introduction 



The Les Houches Workshop on Physics at TeV Colhders took place from June 8-18, 
1999. One of the three working groups at Les Houches concentrated on QCD issues, both 
at the Tevatron Collider and at the LHC. Besides the interest in QCD in its own right, 
QCD dynamics plays an important role in the production mechanisms for any new physics 
processes that might be observed at either collider, as well as any processes that may form 
backgrounds to the new physics. As might be expected, there was a great deal of overlap 
with the other two working groups, and especially with the Higgs working group. 

To provide a more specific focus, each day at Les Houches was devoted to a specific topic. 
The topic and speakers are listed below: 

• Thursday: parton distibutions (W.K. Tung, R. Ball) 

• Friday: photons (E. Pilon, M. Fontannaz, S. Prixione); jet definitions (J. Huston, D. 
Zeppenfeld) 

• Saturday: Monte Carlos (P. Paige, J.Collins, K. Odagiri, Y. Kato, E. Boos, A. Skatchkova, 
V. Ilyin) 

• Monday: joint meeting with Higgs group (C. Balazs) 

• Tuesday: (re)summation (C. Balazs) 

• Wednesday: direct photons and pions (P. Aurenche), heavy flavor (S. Prixione) 

• Thursday: N"LO (V. del Duca, G. Henrich) 

This writeup for the QCD working group is not intended to be a comprehensive summary 
of all of the QCD issues currently important in high energy physics, or expected to be impor- 
tant for the LHC. Rather, we have chosen to concentrate in detail on a few selected topics 
and to summarize in a pedagogical manner the current status and the progress expected in 
the near future. The expertise of the people attending the workshop, and the timeliness of 
the issues, resulted in a great deal of concentration on resummation calculations, and their 
relation to Monte Carlos. This is also reflected in the writeup. The writeup is organized into 
chapters that roughly follow the organization shown above. Preceding the discussion of the 
individual topics is a general introduction to QCD phenomenology relevant for the LHC, by 
S. Catani. 

Joey Huston (for the Working Group). 

Acknowledgements: 

We would like to thank the organizers of the workshop for the pleasant and stimulating 
atmosphere. Les Houches is a beautiful place to do physics; we should do this every two 
years. 
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Aspects of QCD, from the Tevatron to the LHC 



S. Catani 



Abstract 

This contribution presents a selection of the topics (parton densities, fixed- 
order calculations, parton showers, soft-gluon resummation) discussed in my 
introductory lectures at the Workshop and includes a pedagogical overview of 
the corresponding theoretical tools. 

1 Introduction 

The production cross sections for all the processes at hadron-collider experiments are con- 
trolled by strong interaction physics and, hence, by its underlying field theory, QCD (see 
recent overviews in Refs. [0, 0, 0, |]). Studies of QCD at the Tevatron and the LHC have 
two main purposes P, First, they are important to test the predictions of QCD, to 

measure its fundamental parameters (e.g. the strong coupling as) and to extract quantita- 
tive information on its non-perturbative dynamics (e.g. the distribution of partons in the 
proton). Second, they are relevant to a precise estimate of the background to other Standard 
Model processes and to signals of new physics. 

This contribution is not a comprehensive review of QCD at high-energy hadron colliders. 
It is based on a selection of the topics presented in my introductory lectures at this Workshop. 
The selection highlights the QCD subjects that were most discussed during the Workshop 
and includes a pedagogical overview of some of the corresponding theoretical tools. 

After the introduction of the general theoretical framework, I summarize in Sect. |^ the 
present knowledge on the parton densities and its impact on QCD predictions for hard- 
scattering processes at the Tevatron and the LHC. In Sect. H, I then discuss some issues 
related to processes that are sensitive to the gluon density and, hence, to its determination. 
Section ^ presents a dictionary of different approaches (fixed-order expansions, resummed 
calculations, parton showers) to perturbative QCD calculations. The dictionary continues in 
Sect. ^, where I review soft-gluon resummation and discuss some recent phenomenological 
applications of threshold resummation to hadron collisions. 

The QCD framework to describe any inclusive hard-scattering process, 

hM + h,{p^)^H{Q,{...})+X , (1) 
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in hadron-hadron collisions is based on perturbation theory and on the factorization theorem 
of mass singularities. The corresponding cross section is computed by using the factorization 
formula 



C^(Pl,P2;<5, {•••}) = J2 dXidX2fa/hiiXi,IJ,p)fb/h2{x2,I^F)aabiXiPi,X2P2;Q,{---}if4^) 



The colliding hadrons hi and /12 have momenta pi and p2, H denotes the triggered 
hard probe (vector bosons, jets, heavy quarks, Higgs bosons, SUSY particles and so on) 
and X stands for any unobserved particle produced by the collision. The typical scale 
Q of the scattering process is set by the invariant mass or the transverse momentum of 
the hard probe, and the notation {. . .} stands for any other relevant scale and kinematic 
variable of the process. For instance, in the case of W production we have Q = My/ and 
{...} = {Q±, y, . . .}, where Mw, Q± and y are the mass of the vector boson, its transverse 
momentum and its rapidity, respectively. 

The factorization formula (§) involves the convolution of the partonic cross sections aab 
(where a,b = q, q, g) and the parton distributions fa/h{x, of the colliding hadrons. If the 
hard probe if is a hadron or a photon, the factorization formula has to include an additional 
convolution with the corresponding parton fragmentation function da/niz, f^''^)- 

The term O {{Aqcd/Q)^) on the right-hand side of Eq. (0) generically denotes non- 
perturbative contributions (hadronization effects, multiparton interactions, contributions of 
the soft underlying event, and so on). Provided the hard-scattering process (|l]) is sufficiently 
inclusive^, dab is computable as a power series expansion in as(Q^) and the non-perturbative 
contributions are (small) power-suppressed corrections (i.e. the power p is positive) as long 
as the hard-scattering scale Q is larger than few hundred MeV, the typical size of the QCD 
scale Aqcd- 

The parton densities fa/h{x, fJ^'p) are phenomenological distributions that describe how 
partons are bounded in the colliding hadrons. Although they are not calculable in QCD per- 
turbation theory, the parton densities are universal (process-independent) quantities. The 
scale fip is a factorization scale introduced in Eq. (^ to separate the bound-state effects from 
the perturbative interactions of the partons. The physical cross section cr{pi,P2] Q, {■ ■ ■}) 
does not depend on this arbitrary scale, but parton densities and partonic cross sections sepa- 
rately depend on ^p. In particular, higher-order contributions to aab{xipi, X2P2', Q, {■ ■ ■}', 1^^^) 
contain corrections of relative order (as(Q^) InQ^/ /i|')"'. If /iF is very different from Q, these 
corrections become large and spoil the reliability of the perturbative expansion. Thus, in 
practical applications of the factorization formula (|^) , the scale /li? is set approximately equal 
to the hard scale Q and variations oi hf around this central value are used to estimate the 
uncertainty of the perturbative expansion. 

The lower limit Xmin of the integrations over the parton momentum fractions Xi and X2, 
as well as the values of Xi and X2 that dominate the convolution integral in Eq. (Q), are 
controlled by the kinematics of the hard-scattering process. Typically we have Xmin ^Q^/S, 

^ More precisely, it has to be defined in an infrared- and coUinear-safe manner. 




+ o{{AQCD/Qr) 
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LHC parton kinematics 




Figure 1: The (x, Q^) plane of the parton kinematics for the production of a heavy system of 
invariant mass M and rapidity y at LHC, HERA and fixed-target experiments. 

where S* = (pi + ^2)^ is the square of the centre-of-mass energy of the colhsion. If the hard 
probe is a state of invariant mass M and rapidity the dominant values of the momentum 
fractions are Xi^2 ~ (Me^^)/v^ (see Fig. |l]). Thus varying M and y at fixed \fS, we are 
sensitive to partons with different momentum fractions. Increasing the parton densities 
are probed in a kinematic range that extends towards larger values of Q and smaller values 

of Xi,2- 



2 Parton densities 

The parton densities are an essential ingredient to study hard-scattering collisions. Once the 
partonic cross sections have been perturbatively computed, cross section measurements can 
be used to determine the parton densities. Then, they can in turn be used to predict cross 
sections for other hard-scattering processes. 

The dependence of the parton densities^ /a(a;, /i^) on the momentum fraction x and their 
absolute value at any fixed scale /i are not computable in perturbation theory. However, the 
scale dependence is perturbatively controlled by the DGLAP evolution equation 

dUx.^^P =J2['-Pabiasif.%z)Ux/z,f,') . (3) 
d m ^ Jx z 

^In the following the parton densities of the proton fa/p are simply denoted by fa and those of the 
antiproton are obtained by using charge-conjugation invariance, i.e. fa/p — fa/p ^ fa- 
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Overview of Parton Distribution Functions of the Proton 




Figure 2: Typical x-shape of the parton densities (set CTEQ5M at Q = 5 GeV). 



The kernels Pafc(as, z) are the AltareUi-Parisi (AP) sphtting functions. As the partonic cross 
sections in Eq. the AP sphtting functions can be computed as a power series expansion 
in as: 

P^as, z) = «sPr^(.) + alPr''\z) + aiPir"^)(.) + 0(4) . (4) 

The leading order (LO) and next-to-leading order (NLO) terms Pj^l^^\z) and pI^^^^\z) in 
the expansion are known [|l^. These first two terms are used in most of the QCD studies. 
Having determined faix,Ql) at a given input scale /i = Qq, the evolution equation (D can 
be used to compute the parton densities at different perturbative scales /i and larger values 
of X. 

The parton densities are determined by performing global fits to data from deep- 
inelastic scattering (DIS), Drell-Yan (DY), prompt-photon and jet production. The method 
consists in parametrizing the parton densities at some input scale Qq and then adjusting the 
parameters to fit the data. The parameters are usually constrained by imposing the positivity 
of the parton densities {fa{x,fx^) > 0) and the momentum sum rule {Y.a lo dx x fa{x, fi^) = 
I). 



The present knowledge on the parton densities of the proton is reviewed in Refs. M, 12 



13| . Their typical behaviour is shown in Fig. g. All densities decrease at large x. At small x 
the valence quark densities vanish and the gluon density dominates. The sea-quark densities 
also increase at small x because they are driven by the strong rise of the gluon density and 
the splitting of gluons in qq pairs. Note that the quark densities are not flavour-symmetric 
either in the valence sector (m„ 7^ dy) or in the sea sector {u ^ d). 

In addition to having the best estimate of the parton densities, it is important to quantify 
the corresponding uncertainty. This is a difficult issue. The uncertainty depends on the 
kinematic range in x and Q^. Moreover, it cannot be reliably estimated by simply comparing 
the parton densities obtained by different global fits. In fact, a lot of common systematic 
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and common assumptions affect the global-fit procedures. Recent attempts to obtain parton 
densities with error bands that take into account correlations between experimental errors 
are described in Refs. [0, |T^. Some important theoretical uncertainties that are still to be 
understood are also discussed in Ref. [0]. 

The overall conclusion is that the quark densities^ are reasonably well constrained and 
determined by DIS and DY processes, while the gluon density is certainly more uncertain 
111 , p!7| . At small x (x^lO~^), the gluon density fg is at present constrained by a single 



process, namely DIS at HERA. Thus, large higher-order corrections of the type (aslnl/x)" 
could possibly affect the extraction of fg. Assuming that fg is well determined at small x, 
the momentum sum rule reasonably constrains fg at intermediate values of x (x ~ 10~^). 
Jet production at the Tevatron at low to moderate values of the jet transverse energy Ej^ can 
also be useful in constraining the gluon distribution in the range 0.05^x^0.2. At large x 
{x'k, 10^^), the most sensitive process to fg is prompt-photon production. Since, at present, 
prompt-photon data are not well described/predicted by perturbative QCD calculations, 
they cannot be used for a precise determination of fg. Further discussion on these points is 
given in Sect. 0. 

The conclusion that the gluon density is not well known can also be drawn by inspection 
(see Fig. ^ of the differences between the most updated analyses performed by the CTEQ 
Collaboration and the MRST group. 

The differences between the MRST gluons and the CTEQ ones are due to the fact that 
the two groups used different data sets. The various gluon densities are very similar at small 
X, because in this region both groups used the HERA data. The MRST group includes 
prompt-photon data in the global fit: these data constrain the gluon directly at x^lO^^ 

•^Uncertainties on the determination of the quark densities at very high x are discussed in Refs. 0, |l^, 
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Gluon Variations That Are Consistent 




Q = 100 GeV 



lO'" 10'" 10"' 0.1 0.2 0.3 0.4 0.5 

Parton x 

Figure 4: A picture of the gluon density uncertainty. The continuous (black) lines refer to gluon 
densities that are constrained only by DIS and DY data. The dashed (coloured) lines refer to 
gluon densities of the MRST group, which uses also prompt-photon data. 

and indirectly (by the momentum sum rule) at x ~ 10^^. The CTEQ group does not use 
prompt-photon data, but it includes Tevatron data on the one-jet inclusive cross section. 
These data give a good constraint on fg in the region 0.05^x^0.2. 

There are also differences within the MRST and CTEQ sets. The various gluon densities 
of the MRST set correspond to different values of the non-perturbative transverse- momentum 
smearing that can be introduced to describe the differences among the prompt-photon data 
that are available at several centre-of-mass energies. The CTEQ5M and CTEQ5HJ gluons 
correspond to different assumptions on the parametrization of the functional form of fg{x, Qq) 
at large x; the CTEQ5M set corresponds to the minimum-x^ solution of the fit while the 
CTEQ5HJ set (with a slightly higher x^) provides the best fit to the high-ET tail of the 
CDF and DO jet cross sections. 

This brief illustration shows that the differences in the most recent parton densities are 
mainly due to either inconsistencies between data sets and/or poor theoretical understanding 
of them. A more quantitative picture of the dependence on x and of the gluon density 
uncertainty is presented in Fig. ^. 

We can see that the DIS and DY data sets weakly constrain fg for x ^ 10"^. Since the 
AP sphtting functions lead to negative scaling violation at large x, when fg{x, Q"^) is evolved 
at larger scales Q according to Eq. (H) the gluon uncertainty is diluted: it propagates at 
smaller values of x and its size is reduced at fixed x. 

Figure |^ shows the typical predictions for hard-scattering cross sections at the Tevatron 
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proton - (anti)proton cross sections 




Vs (TeV) 

Figure 5: QCD predictions for hard-scattering cross sections at the Tevatron and the LHC. 
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and the LHC, as obtained by using the parton densities of the MRST set. These predictions 
have to be supplemented with the corresponding uncertainties coming from the determi- 
nation of the parton densities and from perturbative corrections beyond the NLO. Owing to 
the increased centre-of-mass energy and to QCD scahng violation (see Fig. the kinematic 
region with small uncertainties is larger at the LHC than at the Tevatron. 

For most of the QCD processes at the LHC, the uncertainty from the parton densities 
is smaller than ±10% and, in particular, it is smaller than the uncertainty from higher- 
order corrections. Some relevant exceptions are the single-jet, W/Z and top quark cross 
sections. In the case of the single-jet inclusive cross section at high Et {Et^2 TeV), the 
uncertainty from the poorly known gluon density at high x is larger than that (~ ±10%) 
from higher-order corrections. The W and Z production cross sections are dominated by qq 
annihilation. Since the quark densities are well known, the ensuing uncertainty on the W/Z 
cross section is small (~ ±5%). Nonetheless, in this case the uncertainty from higher-order 
corrections is even smaller, since the partonic cross sections for the DY process are known 
p!8| at the next-to-next-to-leading order (NNLO) in perturbation theory. In the case of top- 



quark production at the LHC, the gluon channel dominates and leads to an uncertainty of 
±10% on the total cross section. Also for this process, however, the perturbative component 



is known beyond the NLO. Including all-order resummation of soft-gluon contributions [19 



the estimated uncertainty from unknown higher-order corrections is approximately ±5% ^9 
0- 



3 The gluon density issue 

At present, the processes'^ that are, in principle, most sensitive to the gluon density are 
DIS at HERA, 6-quark production at the Tevatron, and prompt-photon production at fixed- 
target experiments. These processes constrain fg for x^lO~^, x ~ 10~^-10~^ and x^lO~^, 
respectively. Nonetheless, the gluon density is, in practice, not well determined. The issue 
(or, perhaps, the puzzle) is that from a phenomenological viewpoint the standard theory, 
namely perturbative QCD at NLO, works pretty well for x ^ 10^^ but not so well at larger 
values of x, while from theoretical arguments we should expect just the opposite to happen. 
This issue is discussed below mainly in its perturbative aspects. We should however keep it 
in mind that all these processes are dominated by hard-scattering scales Q of the order of 
few GeV. Different types of non-perturbative contributions can thus be important. 

From the study of DIS at HERA we can extract information on the gluon and sea-quark 
densities of the proton. The main steps in the QCD analysis of the structure functions at 
small values of the Bjorken variable x are the following. The measurement of the proton 
structure function F2{x,Q'^) ~ qsi^^Q"^) directly determines the sea-quark density qs = 
x{fg + fg). Then, the DGLAP evolution equation (|^) or, more precisely, the following 
equations (the symbol ® denotes the convolution integral with respect to x): 

dF2{x,Q^)/dlnQ^ ~ Pgg^qs + Pgg^g , (5) 

''The role of jet production at the Tevatron has briefly been recafled in Sect. |[ and it is discussed in 
detail in Ref. 
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dg{x,Q^)/d\TiQ^ ~ Pgq®qs + Pgg^g , (6) 

are used to extract a gluon density g{x,Q'^) = xfgi^x^Q"^) that agrees with the measured 
scahng violation in dF2{x,Q'^) / dlnQ"^ (according to Eq. (^) and fulfils the self-consistency 
equation (|^). 

The perturbative-QCD ingredients in this analysis are the AP splitting functions Pab{asi 
Once they are known (and only then), the non-perturbative gluon density can be determined. 

The standard perturbative-QCD framework to extract g{x, Q"^) consists in using the 
truncation of the AP splitting functions at the NLO. This approach has been extensively 
compared with structure function data over the last few years and it gives a good description 
of the HERA data, down to low values of ~ 2 GeV^. The NLO QCD fits simply require 
a slightly steep input gluon density at these low momentum scales. Typically we have 



g{x,Ql) ~ x~^, with A ~ 0.2 at Ql ~ 2GeV^, and the data constrain g{x,Ql) with an 
uncertainty of approximately ±20%. 

Although it is phenomenologically successful, the NLO approach is not fully satisfactory 
from a theoretical viewpoint. The truncation of the splitting functions at a fixed perturbative 
order is equivalent to assuming that the dominant dynamical mechanism leading to scaling 
violations is the evolution of parton cascades with strongly-ordered transverse momenta. 
However, at high energy this evolution takes place over large rapidity intervals {Ay ~ In l/x) 
and diffusion in transverse momentum becomes relevant. Formally, this implies that higher- 
order corrections to Pab{c(s,x) are logarithmically enhanced: 

Pafe(«s,x)~^ + ^(aslnx) + ... + ^(aslnx)" + ... . (7) 

XX X 

At asymptotically small values of a;, resummation of these corrections is mandatory to obtain 
reliable predictions. 

Small-x resummation is, in general, accomplished by the BFKL equation |2^. In the 



context of structure-function calculations, the BFKL equation provides us with improved 
expressions of the AP splitting functions Pab{c(s,x), in which the leading logarithmic (LL) 
terms (aslnx)"", the next-to-leading logarithmic (NLL) terms as (^s In a;)"", and so forth, 
are systematically summed to all orders n in as. The present theoretical status of small-x 
resummation is discussed in Ref. |0. Since in the small-x region the gluon channel dominates, 
only the gluon splitting functions Pgg and Pgg contain LL contributions. These are known 
pO| , to be positive but numerically smaller than naively expected (the approach to the 
asymptotic regime is much delayed by cancellations of logarithmic corrections that occur at 
the first perturbative orders in Pgg and Pgq)- The NLL terms in the quark splitting functions 
Pqg and Pgq are known |2^ and turn out to be positive and large. A very important progress 
is the recent calculation ^ of the NLL terms in Pgg, which are found to be negative 
and large. The complete NLL terms in Pgq are still unknown. 



The results of Refs. pB| , P^ , the large size of the NLL terms and the alternating sign 
(from the LL to the NLL order and from the gluon to the quark channel) of the resummed 
small-x contributions have prompted a lot of activity (see the list of references in Ref. 0) on 
the conceptual basis and the phenomenological implications of small-x resummation. This 
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Figure 6: Comparison between the LO (GRV98LO) and NLO (GRV98NLM) GRV parametriza- 
tions of the gluon and sea-quark densities at = 5 GeV^. 

activity is still in progress and definite quantitative conclusions on the impact of small-x 
resummation at HERA cannot be drawn yet. 

At the same time, the capability of the fixed-order approach to produce a good description 
of the proton structure function F2{x,Q^) at HERA cannot be used to conclude that the 
small-x behaviour of the gluon density is certainly well determined. In fact, by comparing 
LO and NLO results, we could argue that the ensuing theoretical uncertainty on fg is sizeable 
Going from LO to NLO, we can obtain stable predictions for F2, but we have to vary 
the gluon density a lot. As shown in Fig. |^, the NLO gluon density sizeably differs from its 
LO parametrization, not only in absolute normalization but also in x-shape. For instance, 
at a; = 10"^ and = 5 GeV^ the NLO gluon is a factor of 2 smaller than the LO gluon. 
This can be understood [^] from the fact that the scaling violation of F2 is produced by the 
convolution Pgg g (see the right-hand side of Eq. (^). The quark splitting function Pgg 
behaves as 

CAas 1 



1 + 2.2^^- + 

vr X 



where the LO term P^g'^''{x) is flat at small x, whereas the NLO correction is steep. To 
obtain a stable evolution of F2, the NLO steepness of Pgg has to be compensated by a gluon 
density that is less steep at NLO than at LO. This has to be kept in mind when concluding on 
the importance of small-x resummation because the NLO steepness of Pgg is the lowest-order 
manifestation of BFKL dynamics in the quark channel. 

In the large- a; region, there is a well-known correlation between as and fg. At small x, 
there is an analogous strong correlation between the x-shapes of Pqg and fg. In the fixed- 
order QCD analysis of F2, large NLO perturbative corrections at small x can be balanced 
by the extreme flexibility of parton density parametrizations. It is difficult to disentangle 
this correlation between pro cess- dependent perturbative contributions and non-perturbative 
parton densities from the study of a single quantity, as in the case of F2 at HERA. The 
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- NLO QCD, MRSR2 

Theoretical Uncertainty 



6 7 8 9 10 



Pt"" [GeV/c] 



Figure 7: Comparison between Tevatron data and NLO QCD for 6-quark production The 
band is obtained by varying factorization and renormalization scales in the NLO calculation. 



uncertainty on the gluon density at small x, as estimated from the NLO QCD fits of the 
HERA data, is evidently only a lower limit on the actual uncertainty on fg. 

The production of b quarks at the Tevatron is also sensitive to the gluon density at 
relatively small values of x. The comparison between Tevatron data and perturbative-QCD 
predictions at NLO |^ is shown in Fig. ^ Using standard sets of parton densities, the 
theoretical predictions typically underestimate the measured cross section by a factor of 2. 
This certainly is disappointing, although justifiable by the large theoretical uncertainty of 
the perturbative calculation [^. A lower limit on this uncertainty can be estimated by 
studying the scale dependence and the convergence of the perturbative expansion. Varying 
the factorization and renormalization scales by a factor of four around the 6-quark mass rrih, 
the NLO cross section varies by a factor of almost 2 at the Tevatron and by a factor of 4-5 
at the LHC 0. Similar factors are obtained by considering the ratio of the NLO and LO 
cross sections. 

The present theoretical predictions for 6-quark production at hadron colliders certainly 
need to be improved 0. Since the hard scale Q ~ is not very large, a possible im- 
provement regards estimates of non-perturbative contributions (for instance, effects of the 
fragmentation of the 6-quark and of the intrinsic transverse momentum of the colliding par- 
tons). As for the evaluation of perturbative contributions at higher orders, the resummation 
of logarithmic terms of the type ag ln"(j9(/mb) is important |29] when the transverse mo- 
mentum pt of the b quark is much larger than rrify. The resummation of small-x logarithmic 
contributions agln"x can also be relevant, because x ~ 2miy/VS is as small as ~ 10~^ at 
the Tevatron and as ~ 10~^ at the LHC. The theoretical tool to perform this resummation, 
namely the /c^-factorization approach is available. Updated phenomenological studies 



based on this tool and on the information from small-x DIS at HERA would be interesting. 
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Figure 8: The dependence on the factorization (fxp) and renormalization (/i^ 
the LO and NLO prompt-photon cross section da/dEx in pN collisions at Et 
^/S = 31.6 GeV. The resummed calculation is discussed in Sect. |5|. 
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Prompt-photon production at fixed-target experiments is sensitive to the behaviour of the 
gluon density at large x (x^O.l). The theoretical predictions for this process, however, are 
not very accurate. Figure ^ shows the factorization- and renormalization-scale dependence 
of the perturbative cross section for the case of the E706 kinematics. If the scale is varied by 
a factor of 4 around the transverse energy E^ of the prompt photon, the LO cross section 
varies by a factor of almost 4. Going to NLO |^ the situation improves, but not very much, 
because the NLO cross section still varies by a factor of about 2. 

A detailed comparison between NLO QCD calculations and data from the ISR and fixed- 
target experiments has recently been performed in Ref. |32]. As shown in Fig. |^, the overall 



agreement with the theory is not satisfactory, even taking into account the uncertainty com- 
ing from scale variations in the theoretical predictions. Modifications of the gluon density can 
improve the agreement with some data sets only at the expense of having larger disagreement 
with other data sets. The differences between experiments at similar centre-of-mass energies 
(see, for instance, E706 pBe/530 at = 31.6 GeV and WA70 pp at = 23 GeV) are 
much larger than expected from perturbative scaling violations. This can possibly suggest 
1 3^ inconsistencies of experimental origin. 

Another (not necessarily alternative) origin of the differences between data and theory 
could be the presence of non-perturbative effects that are not included in the NLO pertur- 
bative calculation. This explanation has been put forward in Refs. [^, 34 1 by introducing 
some amount of intrinsic]^ transverse momentum {k±) of the colliding partons. Owing to 

^To be precise, in Ref. Isl] the (fcj^) of the colliding partons is not called 'intrinsic', but it is more 
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the steeply falling Et distribution {da/dEx oc l/Ej) of the prompt photon, even a small 
transverse-momentum kickQ can indeed produce a large effect on the cross section, in partic- 
ular, at small values of Et- Phenomenological investigations |3^| show that this additional 
{k±) kick can lead to a better agreement between calculations and data. The E706 data 
suggest the value (fcj,) ~ 1.2 GeV, the WA70 data prefer no and the UA6 data in the 
intermediate range of centre-of-mass energy (v^ = 24.3 GeV) may prefer an intermediate 
value of (fcj.). Similar conclusions are obtained in the analysis by the MRST group [|Tl[] . 

A precise physical understanding of (fc^) effects is still missing. On one side, since the 
amount of suggested by prompt-photon data varies with \fS, it is difficult to argue that 
the transverse momentum is really 'intrinsic' and has an entirely non-perturbative origin. On 
the other side, in the case of the inclusive production of a single photon, a similar effect can- 
not be justified by higher-order logarithmic corrections produced by perturbative soft-gluon 
radiation (see Sect. |^). A lot of model-dependent assumptions (and ensuing uncertainties) 
certainly enter in the present implementations of the {k±) kick. A general framework to con- 
sistently include non-perturbative transverse-momentum effects in perturbative calculations 
is not yet available. Recent proposals with this aim are presented in Refs. [^] and [^. 

Further studies on the consistency between different prompt-photon experiments and on 
the issue of intrinsic- (/c^) effects in hadron-hadron collisions are necessary. Owing to the 
present theoretical (and, possibly, experimental) uncertainties, it is difficult to use prompt- 
photon data to accurately determine the gluon density at large x. Other recent theoretical 
improvements, such as soft-gluon resummation, of the perturbative calculations for prompt- 
photon production at large xt = 1EtI^~S are discussed in Sect. |. 

Studies of other single-particle inclusive cross sections, such as 7r° cross sections ^ 



38 1, can be valuable to constrain the parton densities and could possibly help to clarify some 



of the experimental and theoretical issues arisen by prompt-photon production. 



4 Partonic cross sections: fixed-order expansions, 
resummed calculations, parton showers 

The calculation of hard-scattering cross sections according to the factorization formula @) 
requires the knowledge of the partonic cross sections o", besides that of the parton densities. 
The partonic cross sections are usually computed by truncating their perturbative expansion 
at a fixed order in as: 

^(Pi,P2; g, {Qi, . . .}; /i|) = cc\{ix\) {a(^o)(Pi,P2; Q, {Qi, • • •}) (9) 

+ as(/x?j) o^'''''^\vx.V2\ Q, {Qi, . . .}; /i|) 
+ «i(/z^) a(^^^«)(pi,p2; g, {Qi, . . .}; /x^) + . . .} . 

generically called the (fcj.) 'from initial-state soft-gluon radiation'. 

^The Et distribution of the single-photon is not calculable down to Et — or, in other words, da/dET 
is not integrable in the entire kinematic range of Et- Thus, the intrinsic {k\^ of the incoming partons does 
not simply produce a shift of events from the 1ow-_Bt to the high-i?T region. For this reason, the terminology 
'{k±) kick' seems to be more appropriate than '(/c±) smearing'. 
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Figure 9: A comparison between NLO QCD calculations and data from the ISR and fixed-target 
experiments for the prompt-photon distribution da/dEx {xt = 2Et/VS)- 



The scale is the arbitrary renormahzation scale introduced to define the perturbative 
expansion. Although the 'exact' partonic cross section on the left-hand side of Eq. does 
not depend on /x^, each term on the right-hand side (and, hence, any fixed-order truncation) 
separately depends on it. 

The LO (or tree-level) term a^^'^^ gives only an estimate of the order of magnitude of the 
partonic cross section, because at this order as is not unambiguously defined. Equivalently, 
we can say that since o"*-^*^^ does not depend on /i^, the size of its contribution can be varied 
quite arbitrarily by changing /i/j in its coefficient as(/^R)- '^^^ strong coupling as can be 
precisely defined only starting from NLO. A 'reliable' estimate of the central value of a thus 
requires the knowledge of (at least) the NLO term a^^^^\ This term explicitly depends on 
liR and this dependence begins to compensate that of as(/x^). 

In general, the n-th term in the curly bracket of Eq. (^) contains contributions of the type 
(as(/i|.) InQZ/i^)". If /i^ is very different from the hard scale Q, these contributions become 
large and spoil the reliability of the truncated expansion (P). Thus, in practical applications 
the scale iir should be set approximately equal to the hard scale Q. As mentioned in 
Sect. D, variations of hr around this central value are typically used to set a lower limit on 
the theoretical uncertainty of the perturbative calculation. 

A better estimate of the accuracy of any perturbative expansion is obtained by considering 
the effect of removing the last perturbative term that has been computed. Since as can be 
precisely defined only at NLO, this procedure can consistently be applied to Eq. (|) only as 
from its NNLO term. A 'reliable' estimate of the theoretical error on a thus requires the 
knowledge of the NNLO term a^^^^o) 
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The LO and NLO approximations of a are used at present in (most of) the fixed-order 
QCD calculations. Prospects towards NNLO calculations of partonic cross sections and AP 
splitting functions are reviewed in Refs. |0, [39[] . 



The fixed-order expansion @ provides us with a well-defined and systematic framework 
to compute the partonic cross section (3"(pi,p2; Q, {<5i, • • •}; /^f) of any hard-scattering process 
that is sufficiently inclusive or, more precisely, that is defined in an infrared- and coUinear- 
safe manner. However, the fixed-order expansion is reliable only when all the kinematical 
scales Q, {Qi, . . .} are of the same order of magnitude. When the hard-scattering process 
involves two (or several) very different scales, say Q\ 3> Q, the N"LO term in Eq. (P) can 
contain double- and single-logarithmic contributions of the type (as-^^)"" and (as^^)" with 
L = ln{Qi/Q) ^ 1. These terms spoil the reliability of the fixed-order expansion and have 
to be summed to all orders by systematically defining logarithmic expansions (resummed 
calculations). 

Typical large logarithms, L = In Q/Qq, are those related to the evolution of the par- 
ton densities from a low input scale Qq to the hard-scattering scale Q. These logarithms 
are produced by collinear radiation from the colliding partons and give single-logarithmic 
contributions. They never explicitly appear in the calculation of the partonic cross section, 
because they are systematically (LO, NLO and so forth) resummed in the evolved parton 
densities f{x,Q^) by using the DGLAP equation (H). 

Different large logarithms, L = In Q/ v^, appear when the centre-of-mass energy v^of 
the collision is much larger than the hard scale Q. These small-x {x = Q/\fS) logarithms 
are produced by multiple radiation over the wide rapidity range that is available at large 
energy. They usually give single-logarithmic contributions that can be resummed by using 
the BFKL equation. BFKL resummation is relevant to DIS structure functions at small 
values of the Bjorken variable x (see Sect. ^ and it can also be important at the LHC for 
the production of h quarks and of prompt photons at relatively low Et- 

Another class of large logarithms is associated to the bremsstrahlung spectrum of soft 
gluons. Since soft gluons can be radiated collinearly, they give rise to double-logarithmic 
contributions to the partonic cross section: 

a ~ |l + £ L^^ + C^£.,L^-' + cS-.L'^' + •.•)} • (10) 

Soft-gluon resummation is discussed in Sect. |^. 

A related approach to evaluate higher-order contributions to the partonic cross sections 
is based on Monte Carlo parton showers (see |^0[ and the updated list of references in 



Rather than computing exactly a^^^*^), Q-i^^LO) fQj^^j^^ the parton shower gives 

an all-order approximation of the partonic cross section in the soft and collinear regions. In 
this respect, the computation of the partonic cross sections performed by parton showers is 
somehow similar to that obtained by soft-gluon resummed calculations. There is, however, an 
important conceptual difference between the two approaches. This difference and the limits 
of applicability of the parton-shower method are briefly recalled below. Apart from these 
limits, parton-shower calculations can give some advantages. Multiparton kinematics can be 
treated exactly. The parton shower can be supplemented with models of non-perturbative 
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effects (hadronization, intrinsic k±, soft underlying event) to provide a complete description 
of the hard-scattering process at the hadron level. 

For a given cross section, resummed calculations can in principle be performed to any 
logarithmic accuracy. The logarithmic accuracy achievable by parton showers is instead in- 
trinsically limited by quantum mechanics. The parton-shower algorithms are probabilistic. 
Starting from the LO cross section, the parton shower generates multiparton final states 
according to a probability distribution that approximates the square of the QCD matrix 
elements. The approximation is based on the universal (process-independent) factorization 
properties of multiparton matrix elements in the soft and collinear limits. Although the 
matrix element does factorize, its square contains quantum interferences, which are not 
positive-definite and, in general, cannot be used to define probability distributions. To lead- 
ing infrared accuracy, this problem is overcome by exploiting QCD coherence (see Refs. |^ 



4T| , |42[| and referencees therein) : soft gluons radiated at large angle from the partons involved 
in the LO subprocess destructively interfere. This quantum mechanical effect can be simply 
implemented by enforcing an angular-ordering constraint on the phase space available for the 
parton shower evolution. Thus, angular-ordered parton showers can consistently compute 
the first two dominant towers (agL^" and agL^"~^) of logarithmic contributions in Eq. (p^O]). 
However, parton showers contain also some subleading logarithmic contributions. For in- 
stance, they correctly compute the single-logarithmic terms OgL" of purely collinear origin 



that lead to the LO evolution of the parton densities. Moreover, as discussed in Ref. 



by a comparison with resummed calculations, in the case of hard-scattering processes whose 
LO subprocess involves two coloured partons (e.g. DIS or DY production), angular-ordered 
parton showers have a higher logarithmic accuracy: they can consistently evaluate the LL 
and NLL terms in Eq. (pIS)). The extension of parton-shower algorithms to higher logarithmic 
accuracy is not necessarily feasible and is, in any case, challenging. 

Of course, because of quantum interferences and quantum fiuctuations, the probabilistic 
parton-shower approach cannot be used to systematically perform exact calculations at NLO, 
NNLO and so forth. Nonetheless, important progress has been made to include matrix 
element corrections in parton shower algorithms p4|-^. The purpose is to consider the 
multiparton configurations generated by parton showering from the LO matrix element and 
to correct them in the hard (non-soft and non-coUinear) region by using the exact expressions 
of the higher-order matrix elements. Hard matrix element corrections to parton showers have 
been implemented for top quark decay and for production of W, Z and DY lepton pairs 
p9| , |50| , The same techniques could be applied to other processes, as, for instance, 

production of Higgs boson and vector-boson pairs . 

Note also that, at present, angular-ordered parton showers cannot be considered as true 
'next-to-leading' tools, even where their logarithmic accuracy is concerned. The consistent 
computation of the first two towers of logarithmic contributions in Eq. ( p!oD is not sufficient 
for this purpose. For instance, to precisely introduce an NLO definition of as, we should 
control all the terms obtained by the replacement as — > as + c ag + 0(ag). When it is 
introduced in the towers of double- logarithmic terms agL^" of Eq. (|TU]), this replacement 
leads to contributions of the type ag'^'^L^" ~ agL^""^. Since these contributions are not 
fully computable at present, the parameter as used in the parton showers corresponds to a 
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simple LO parametrization of QCD running coupling. 



5 Soft-gluon resummation 

Double-logarithmic contributions due to soft gluons arise in all the kinematic configurations 



where radiation of real and virtual partons is highly unbalanced (see Ref. |^3| and references 
therein). For instance, this happens in the case of transverse- momentum distributions at 
low transverse momentum, in the case of hard-scattering production near threshold or when 
the structure of the final state is investigated with high resolution (internal jet structure, 
shape variables). 

Soft-gluon resummation for jet shapes has been extensively studied and applied to hadronic 
final states produced by e^e~ annihilation 54]. Applications to hadron-hadron col- 



lisions have just begun to appear and have a large, yet uncovered, potential (from as 
determinations to studies of non-perturbative dynamics). 

Transverse-momentum logarithms, L = InQ^/Q^, occur in the distribution of transverse 
momentum of systems with high mass Q {Q ^ Q±) that are produced with a vanishing 
Qj^ in the LO subprocess. Examples of such systems are DY lepton pairs, lepton pairs 
produced by W and Z decay, heavy quark-ant iquark pairs, photon pairs and Higgs bosons. 
In these processes the LO transverse-momentum distribution is sharply peaked around Qj^ = 
{da/(PQ^ oc 5'^'^\Q±))- If the heavy system is produced with Q\ <^ Q^, the emission 
of real radiation at higher orders is strongly suppressed and cannot balance the virtual 
contributions. The ensuing logarithms, L = InQ^/Q^, diverge order by order when — >■ 0, 
but after all-order resummation they leads to a finite smearing of the LO distribution. 

Threshold logarithms, L = ln(l — x), occur when the tagged final state produced by the 
hard scattering is forced to carry a very large fraction x {x —>■ 1) of the available centre-of- 
mass energy y/S. Also in this case, the radiative tail of real emission is stronly suppressed 
at higher perturbative orders. Oustanding examples of hard processes near threshold are 
DIS at large x (here x is the Bjorken variable), production of DY lepton pairs with large 
invariant mass Q {x = Q/y/S), production of heavy quark-antiquark pairs (x = 2mQ/\/S), 
production of single jets and single photons at large transverse energy Et (x = 2Et/^). 

To emphasize the difference between transverse-momentum logarithms and threshold log- 
arithms generated by soft gluons, it can be instructive to consider prompt-photon production. 
In the case of production of a photon pai'^ with invariant mass squared = [p^^^ + p'^^Y 
and total transverse momentum Qj^ = p\^}_ +P2±5 transverse-momentum logarithms and 
threshold logarithms appear when Q\ <^ and Q\ ~ (S/A — Q"^), respectively. How- 
ever, in the case of production of a single photon with transverse energy (or, equivalently, 
transverse momentum) E^, soft gluons can produce logarithms only in the threshold region 
xt = 2Et/\/S — > 1. If the prompt photon has a transverse energy that is not close| to its 
threshold value, the emission of accompanying radiation is not kinematically suppressed and 

^The same discussion applies to the production of a DY lepton pair. 

^Eventually, when *C 1, higher-order corrections are single-logarithmically enhanced. This small- a; 
logarithms, (as Iuzt)", have to be taken into account by BFKL resummation. 
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there are no soft logarithms analogous to those in the transverse-momentum distribution 
of a photon pair. In particular, there are no double-logarithmic contributions of the type 
(as In E^/S)"', and perturbative soft gluons are not distinguishable from perturbative hard 
gluons. 

Studies of soft-gluon resummation for transverse-momentum distributions at low trans- 
verse momentum and for hard-scattering production near threshold started two decades ago 
4^ , The physical bases for a systematic all-order summation of the soft-gluon contri- 



butions are dynamics and kinematics factorizations [Q. The first factorization follows from 
gauge invariance and unitarity: in the soft limit, multigluon amplitudes fulfil factorization 
formulae given in terms of universal (process-independent) soft contributions. The second 
factorization regards kinematics and strongly depends on the actual cross section to be eval- 
uated. //, in the appropriate soft limit, the multiparton phase space for this cross section 
can be written in a factorized way, resummation is analytically feasible in form of generalized 
exponentiation of the universal soft contributions that appear in the factorization formulae 
of the QCD amplitudes. 

Note that the phase space depends in a non-trivial way on multigluon configurations 
and, in general, is not factorizable in single-particle contributions^. Moreover, even when 
phase-space factorization is achievable, it does not always occur in the space of the kinematic 
variables where the cross section is defined. Usually, it is necessary to introduce a conju- 
gate space to overcome phase-space constraints. This is the case for transverse-momentum 
distributions and hard-scattering production near threshold. The relevant kinematical con- 
straint for Qj^-distributions is (two-dimensional) transverse-momentum conservation and it 
can be factorized by performing a Fourier transformation. Soft-gluon resummation for Qj^- 
distributions is thus carried out in 6-space pOl , |61|| , where the impact parameter b is the 



variable conjugate to Qj^ via the Fourier transformation. Analogously, the relevant kine- 
matical constraint for hard-scattering production near threshold is (one-dimensional) energy 
conservation and it can be factorized by working in A^-moment space []62| , |63| , being 
the variable conjugate to the threshold variable x (energy fraction) via a Mellin or Laplace 
transformation. 

Using a short-hand notation, the general structure of the partonic cross section a after 
summation of soft-gluon contributions is 

O" = (Trcs. + ^rem. ■ (H) 

The term (Jres. embodies the all-order resummation, while the remainder (Trem. contains no 
large logarithmic contributions. The latter has the form 

<Trem. = <T(^-°-)-[ares.]^'°-^ , (12) 

and it is obtained from ° \ the truncation of the perturbative expansion for a at a given 
fixed order (LO, NLO, ...), by subtracting the corresponding truncation [(Jres.]*'^'°''' of the 

^In the case of jet cross sections, for instance, phase-space factorization depends on the detailed definition 
of jets and it can easily be violated Q. Some jet algorithms, such as the fc_L-algorithm |5^, have better 
factorization properties. 
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resummed part. Thus, the expression on the right-hand side of Eq. ( pAj ) includes soft-gluon 
logarithms to all orders and it is matched to the exact (with no logarithmic approximation) 
fixed-order calculation. It represents an improved perturbative calculation that is everywhere 
as good as the fixed-order result, and much better in the kinematics regions where the 
soft-gluon logarithms become large {asL ~ 1). Eventually, when asL^l, the resummed 
perturbative contributions are of the same size as the non-perturbative contributions and 
the effect of the latter has to be implemented in the resummed calculation. 
The resummed cross section has the following typical form: 

ares. = 4 [ -c-s, (13) 

J inv. 

where the integral /j^^ denotes the inverse tranformation from the conjugate space where 
resummation is actually carried out. Methods to perform the inverse transformation are 
discussed in Refs. and for Q^-resummation and threshold resummation, respectively. 
The C term has the perturbative expansion 

C= 1 + Ci«s + C2«i + ... (14) 

and contains all the constant contributions in the limit L — oo (the coefficients Ci, C2, ■ ■ ■ 
do not depend on the conjugate variable). The singular dependence on L (more precisely, 
on the logarithm L of the conjugate variable) is entirely exponentiated in the factor S: 

S = exp {L gi{asL) + g2{asL) + as gsiasL) + . . .} . (15) 

In the exponent, the function L gi resums all the leading logarithmic (LL) contributions 
ttgL""*"^, while g2 contains the next-to-leading logarithmic (NLL) terms OgL" and so forthQ 
(all the functions gi are normalized as ^'^(A = 0) =0). Note that the LL terms are formally 
suppressed by a power of as with respect to the NLL terms, and so forth for the successive 
classes of logarithmic terms. Thus, this logarithmic expansion is as systematic as the fixed- 
order expansion in Eq. (P). In particular, using a matched NLL+NLO calculation, we 
can consistently i) introduce a precise definition (say MS) of as(/i) and ii) investigate the 
theoretical accuracy of the calculation by studying its renormalization-scale dependence. 

The structure of the exponentiated resummed calculations discussed so far has to be 
contrasted with that obtained by organizing the logarithmic expansion on the right-hand 
side of Eq. (^) in terms of towers as 

a ^ a'^a^^^^ {t,{asL^) + asL t2{asL^) + alL^ t3{asL^) + . . .} , (16) 

where the double-logarithmic function ti(as-^^) and the successive functions are normalized 
as ti{0) = const. While the ratio of two successive terms in the exponent of Eq. ([T5|) is 

^°To compare this notation with that of Ref. we can notice that our functions Qj are obtained by 
the straightforward integration over JI of the functions A{as(jl)) and B{as{]l)) of Ref. |Q. In particular, 
our terms gi,g2,g3 are not to be confused with the non-perturbative parameters of the same name used in 
Ref. 
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formally of the order of as, the ratio of two successive towers in Eq. ( p!6[ ) is formally of 
the order of a^L. In other words, the tower expansion sums the double-logarithmic terms 
(as-Z^^)", then the terms agL^"""*^ ~ asL{aQ,L'^Y~^ , and so forth; it thus assumes that the 
resummation procedure is carried out with respect to the large parameter a^L'^ (as-^^ ^ !)• 
On the contrary, in Eq. (^) the large parameter is a^L^l. The tower expansion allows 
us to formally extend the applicability of perturbative QCD to the region ^/a^, and 

exponentiation extends it to the wider region L ^ 1 /as • This fact can also be argued by 
comparing the amount of information on the logarithmic terms that is included in the trun- 
cation of Eqs. ([ISD and ([16| ) at some logarithmic accuracy. The reader can easily check 
that, after matching to the NLO (LO) calculation as in Eq. (pT]), the NLL (LL) result of 
Eq. (|15D contains all the logarithms of the first four {two) towers in Eq. (|16D (and many 
more logarithmic terms). 

In the case of Q^-distributions, full NLL resummation has been performed for lepton 
pairs, W and Z bosons produced by the DY mechanism |^T], ^ and for Higgs bosons pro- 
duced by gluon fusion [^. Corresponding resummed calculations are discussed in Refs. 
SB| and references therein. 

Threshold logarithms in hadron collisions have been resummed to NLL accuracy for DIS 
and DY production [|62i 33, 43, 69] and for Higgs boson production |7^. Recent theoretical 
progress ||7l], |7^, regards the extension of NLL resummation to processes produced by LO 
hard-scattering of more than two coloured partons, such as heavy-quark hadropro duct ion 
7T| , [19| and leptoproduction [Q, as well as prompt-photon |[7^-|77|, quarkonium |]78|] and 



vector-boson |^ production. 

An important feature of threshold resummation is that the resummed soft-gluon contri- 
butions regard the partonic cross section rather than the hadronic cross section. This fact has 
two main consequences: i) soft-gluon contributions can be sizeable long before the threshold 
region in the hadronic cross section is actually approached, and ii) the resummation effects 
typically enhance the fixed-order perturbative calculations. 

The first consequence follows from the fact that the evolution of the parton densities 
sizeably reduces the energy that is available in the partonic hard-scattering subprocess. 
Thus, the partonic cross section a in the factorization formula is typically evaluated much 
closer to threshold than the hadronic cross section. In other words, the parton densities are 
strongly suppressed at large x (typically, when x 1, f{x,iJ?) ~ {1 — x^ with 77 ~ 3 
and ?7 ~ 6 for valence quarks and sea-quarks or gluons, respectively); after integration over 
them, the dominant value of the square of the partonic centre-of-mass energy (s) = {xiX2)S 
is therefore substantially smaller than the corresponding hadronic value S. 

The second consequence, which depends on the actual definition of the parton densities, 
follows from the fact that the resummed contributions are those soft-gluon effects that are left 
at the partonic level after factorization of the parton densities. After having absorbed part 
of the full soft-gluon contributions in the customary definitions (for instance, those in the 
MS or DIS factorization schemes) of the parton densities, it turns out that the residual effect 
in the partonic cross section is positive and tends to enhance the perturbative predictions. 

A quantitative illustration of these consequences is given below by discussing top-quark 
and prompt-photon production. The discussion also shows another relevant feature of 
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Figure 10: The tt production cross section in collisions as a function of v^. The solid lines 
represent the NLO results for different choices (/i = and pi, = 2mt) of the renormaliza- 

tion/factorization scale /i = /i^ = fip, normalized to the result with /i = mj. The dashed lines 
represent the NLO+NLL results for different choices of /i (/i = mt/2,mt and 2mt), normalized 
to the NLO result with /i = rrit. 



NLO+NLL calculations, namely, their increased stability with respect to scale variations. 
The effects of soft-gluon resummation on the top-quark production cross sections at 



hadron colliders have been studied in Refs. |19, p0|-|83||. In the case of pp collisions, the 



comparison between QCD predictions at NLO and those after NLL resummation is shown 



in Fig. |T0| [0. At the Tevatron the resummation effects are not very large and the NLO 
cross section is increased by about 4%. This had to be expected because the top quark is not 
produced very close to threshold (x = 2mt/ VS ~ 0.2, at the Tevatron). Note, however, that 
the dependence on the factorization/renormalization scale of the theoretical cross section is 
reduced by a factor of almost 2 by including NLL resummation. More precisely, the scale 
dependence (~ ±5%) of the NLO+NLL calculation becomes comparable to that obtained 
by using different sets of parton densities []TT|. Combining linearly scale and parton density 



uncertainties, the NLO+NLL cross section is au = 5.0 ± 0.6, with = 175 GeV and 
v^= 1.8 TeV ig. 

At the LHC (x = 2mt/VS ~ 0.03) the top quark is produced less close to the hadronic 
threshold than at the Tevatron. However this is compensated by the fact that the gluon 
channel^ is more important at the LHC. As a result, the effect of including soft-gluon 
resummation to NLL accuracy is very similar: the NLO cross section is enhanced by ~ 
5% and its scale dependence is reduced from ~ ±10% to ~ ±5%. Note, however, that 

^^Since fg is steeper than fq at large x, partonic cross sections in gluon subprocesses are typically closer to 
threshold than in quark subprocesses. Moreover, the intensity of soft-gluon radiation from gluons is larger 
than that from quarks by a factor of ~ Ca/Cf ~ 2. 



24 




Figure 11: The dependence on the factorization (fxp) and renormalization (/x^ = fi) scale of the 
prompt-photon cross section da/dET in pN collisions at Et = 10 GeV and = 31.6 GeV. 
The short-dashed, long-dashed and solid lines are respectively the results at LO, NLO and after 
NLO-I-NLL resummation. 



the uncertainty (~ ±10%) coming from the parton (gluon) densities is larger than at the 
Tevatron [0. 

Similar qualitative results are obtained ||76|] when NLL resummation is applied to prompt- 
photon production at fixed-target experiments. The scale dependence of the theoretical 
calculation is highly reduced and the resummed NLL contributions lead to large corrections 
at high xt = 2Et/VS (and smaller corrections at lower xt)- Of course, the impact of 
soft-gluon resummation is quantitatively more sizeable in prompt-photon production than 
in top-quark production, because xt can be as large as 0.6, the hard scale Et is much 
smaller than rrit (thus, as{ET) > as('^t)) and the gluon channel is always important. The 



scale dependence of the theoretical cross section for the E706 kinematics is shown in Fig. ^ 



Fixing hp> = jj^F = jJ' and varying jd in the range Et/2 < jj, < 2Et with Et = 10 GeV, the 
cross section varies by a factor of ~ 6 at LO, by a factor of ~ 4 at NLO and by a factor of 
~ 1.3 after NLL resummation. The highly reduced scale dependence of the NLO+NLL cross 



section is also visible in Fig. [1^, which, in particular, shows that when Et = 10 GeV and 



-E'beam = 530 GeV the central value (i.e. with /x = Et) of the NLO cross section increases by 
a factor of ~ 2.5 after NLL resummation. As expected, the size of these effects is reduced 
by increasing \/S at fixed Et (see Fig. or by decreasing Et at fixed (see Fig. |^). 

The comparison with the E706 data shown in Fig. |13| suggests that the NLO+NLL 
calculation can help to better understand prompt-photon production at large xt- Note, 
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Figure 12: The prompt-photon cross section da/dEx in pN collisions at Et = 10 GeV as a 
function of the energy -Bbeam of the proton beam. The solid lines represent the NLO results for 
different choices (/x = Et/2 and // = 2Et) of the renormalization/factorization scale fj, = = 
Hp, normalized to the result with ^ = E^- The dashed lines represent the NLO+NLL results for 
different choices of jj, {jj, = Et/2, Et and 2Et), normalized to the NLO result with jj, = Et- 
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Figure 13: E706 prompt-photon data compared with theoretical calculations, which use the 
parton densities of the set CTEQ4 and GRV photon fragmentation functions. The solid and 
dashed lines correspond to the NLO+NLL and pure NLO calculations, respectively. 



however, that this comparison has to be regarded as prehminary in several respects In 
particular, the parton densities used in Fig. |13| are those extracted from NLO fits. Owing 
to the soft-gluon enhancement at large xt, refitting the parton densities may lead to a 
smaller fg at large x and, consequently (because of the momentum sum rule), a larger fg 
at intermediate x. As a result, this procedure could somehow increase the theoretical cross 
section also at smaller values of xt- 

Soft-gluon resummation at NLL accuracy is now available for all the processes (namely, 
DIS, DY and prompt-photon production) that are typically used to perform global fits to 
parton densities. A detailed extraction/evolution of parton densities by consistently using 
NLL resummed calculations is thus nowadays feasible. 



6 Other topics 

The activity of the QCD Working Group at this Workshop has also been devoted to other 
topics, such as automatic computation of matrix elements and LO cross sections for multipar- 
ticle processes at high-energy colliders, definition and properties of jets algorithms, definition 
of isolated photons and related NLO calculations. Corresponding contributions are included 
in these Proceedings. 

Other studies performed during this Workshop have a large overlap with the activity 
of the related Workshops at FERMILAB and CERN and can be found in those Proceed- 
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Partons for the LHC 



R.D. Ball Q and J. Huston 



Abstract 

We discuss some of the experimental, theoretical and methodological issues in the 
determination of parton distributions with meaningful error estimates, and their impact 
on physical cross sections to be measured at the Tevatron and LHC. 

1 Introduction 

The calculation of production cross sections at the Tevatron and LHC, for both interesting 
physics processes and their backgrounds, relies upon a knowledge of the distribution of 
the momentum fraction x of the partons in a proton at the relevant scale. These parton 
distribution functions (pdfs) are at present determined by global fits to data from deep 
inelastic scattering (DIS), Drell-Yan (DY), and jet and direct photon production at current 
energy ranges. Two groups, CTEQ and MRS, provide semi-regular updates to their best-fit 
parton distributions when new data and/or theoretical developments become available. The 
newest pdfs, in most cases, currently provide the single most accurate overall description of 
the world's data, and should be utlilized in preference to older pdf sets. The most recent 
sets from the two groups are CTEQ5 and MRST . 

In this contribution we will discuss the data sets used in the fits, the way in which the 
fits are performed in practice (in particular, issues such as the parametrization of initial dis- 
tributions, the solution of the evolution equations, and scheme dependence), and the main 
uncertainties in the fitted pdfs due to uncertain or incomplete experimental data. In par- 
ticular, we will concentrate on the difficulties involved in determining the gluon distribution 
through direct photons or jets. We then move on to discuss more general issues which may 
affect future pdf determinations: the inclusion of correlated systematics and the difficulties 
involved in combining these for different experiments, purely theoretical uncertainties aris- 
ing from the limitations of NLO perturbative QCD, and finally, methodological uncertainties 
such as the dependence on the form of the parametrization and the assumption of Gaussian 
error propagation. We conclude with a summary of the progress that might be made before 
the LHC turns on, and the role of LHC data in determining pdfs. 

^^Royal Society University Research Fellow. 
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2 Processes Involved in Global Analysis Fits 



Lepton-lepton, lepton-hadron and hadron-hadron interactions probe complementary aspects 
of perturbative QCD (pQCD). Lepton-lepton processes provide clean measurements of asiQ"^) 
and of the fragmentation functions of partons into hadrons. Measurements of deep-inelastic 
scattering (DIS) structure functions (^2,^3) in lepton-hadron scattering and of lepton pair 
production cross sections in hadron-hadron collisions are the main source of information on 
the quark distributions Q) inside hadrons. Scaling violations in deep inelastic processes 
give some information about the gluon distribution g{x,Q). Furthermore the gluon distri- 
bution function enters directly (i.e. at leading order) in hadron-hadron scattering processes 
with direct photon and jet final states. Modern global parton distribution fits are carried 
out to next-to-leading (NLO) order which allows as{Q'^),q"'{x,Q) and g{x,Q) to all mix 
and contribute in the theoretical formulae for all processes. Nevertheless, the broad picture 
described above still holds to some degree in global pdf analyses. 

In pQCD, the gluon distribution is always accompanied by a factor of as, in both the 
hard scattering cross sections and in the evolution equations for parton distributions. Thus, 
determination of and the gluon distribution is, in general, a strongly coupled problem. 
One can determine separately from e~^e~ or determine and g{x, Q) jointly in a global 
pdf analysis. In the latter case, though, the coupling of and the gluon distribution may 
not lead to a unique solution for either (see for example the discussion in the CTEQ4 paper 
where good fits were obtained to a global analysis data set, including the inclusive jet data, 
for a wide range of values 0.) 

Currently, the world average value of as{Mz) is 0.119 ± 0.004 0. This is in agreement 
with the average value from LEP, while the DIS experiments prefer a slightly smaller value 
(of the order of 0.116 — 0.118)). Since global pdf analyses are dominated by the high statistics 
DIS data, they would tend to favor the values of as closer to the lower DIS values. The 
more logical approach is to adopt the world average value of as{Mz) and concentrate on the 
determination of the pdfs. This is what both CTEQ and MRS currently do. 

The data from DIS, DY, direct photon and jet processes utilized in pdf fits cover a wide 
range in x and Q = \fQ^ . The kinematic 'map' in the (x, Q) plane of the data points 
used in a recent parton distribution function analyses is shown in Figure |I[ The HERA 
data (Hl+ZEUS) are predominantly at low x, while the fixed target DIS and DY data are 
at higher x. There is considerable overlap, however, with the degree of overlap increasing 
with time as the statistics of the HERA experiments increases. DGLAP-based NLO pQCD 
provides an accurate description of the data (and of the evolution of the parton distributions) 
over the entire kinematic range shown. At very low x and Q^, DGLAP evolution is believed 
to be no longer applicable due to unresummed small x logarithms. Similarly at very large x 
there are significant contributions from unresummed soft logarithms (logarithms of 1 — x). 
However, no evidence for such corrections is seen in the current range of data; thus all global 
analyses use conventional DGLAP evolution of pdfs. 

^•^One can either quote a value of as{Mz) or the value of A*^^. In the latter case, however, the number 
of flavors has to be clearly specified, since the value of (and not A^^"^) has to be continuous across flavor 
thresholds. 
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Figure 1: The kinematic map in the {x,Q) plane of data points used in the CTEQ5 analysis. 

There is a remarkable consistency between the data in the pdf fits and the NLO QCD 
theory fit to them. Over 1300 data points are shown in Figure |I] and the x^/d-o.f. for the 
fit of theory to data is on the order of one. 

Parton distributions determined at a given x and propagate down to lower x values 
at higher values. The accuracy of the extrapolation to higher depends both on the 
accuracy of the original measurement and any uncertainty on asiQ"^)- For the structure 
function F2, the typical measurement uncertainty at medium to large x is on the order of 
±3%. At large x, the DGLAP equation for F2 can be approximated as q^^^q2 = as{Q'^)P'^'^® 
F2. There is an extrapolation uncertainty of around ±5% in F2 from low to high (10^ 
GeV"^) from the uncertainty in a^. Evolved distributions are also susceptible to uncertainties 
from an anomalously large contribution to F2 near x values of 1. Such a contribution may 
not be evident in fixed target measurements at low x and low Q"^, but may influence higher 

measurements 0. 

For comparison, the kinematics appropriate for the production of a state of mass M and 
rapidity y at the LHC is shown in Figure ^ |^. For example, to produce a state of mass 100 
GeV and rapidity 2 requires partons with x values between 0.05 and 0.001 at a value of 10^ 
GeV^. Also shown in the figure is another view of the kinematic coverage of the fixed target 
and HERA experiments used in pdf fits. It can be seen that parton distributions determined 
from these experiments are sufficient to predict most LHC cross-sections of interest, provided 
that DGLAP evolution at small and large x is sufficiently reliable. 
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LHC parton kinematics 




Figure 2: A plot of LHC parton kinematics in (x, Q"^) space. Also shown are the reach of fixed 
target and HERA experiments. 
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3 Evolution, Schemes and Parametrizations 



3.1 Evolution Codes 

In order to fit the initial pdfs to experimental data they need to be evolved up to the correct 
scale by solving the DGLAP equations either to LO or NLO. The evolution can be carried 
out in either moment space or configuration space: both MRS and CTEQ use configuration 
space codes. Improvements have been made in the CTEQ and MRST evolution programs 
so that both now agree with the 'DESY standard' evolution prescription The CTEQ 
and MRST packages should be able to carry out the evolution using NLO DGLAP to an 
accuracy of a few percent over the LHC kinematic range, except perhaps at very large and 
very small x. Note that the theoretical predictions for the W and Z total cross sections 
at the LHC may have uncertainties of less than 5%[§]. This puts a great demand for the 
pdf evolution to have accuracies of better than a few percent, since any error on a pdf gets 
doubled in the cross section calculation. Mellin space codes might be the answer here. 

A global pdf analysis carried out at next-to-leading order needs to be performed in a 
specific renormalization and factorization scheme. The evolution kernels are in a specific 
scheme and to maintain consistency, any hard scattering cross section calculations used for 
the input processes or utilizing the resulting pdfs need to also have been implemented in 
that same renormalization scheme. Almost universally, the MS scheme is used: pdfs are 
also available in the DIS scheme, a fixed fiavor scheme (as in ref. fpfl) and several schemes 



that differ in their specific treatment of the charm quark mass ||T0| , |Tl| . 

It is also possible to use only leading-order matrix element calculations in the global fits 
which results in leading-order parton distribution functions. Such pdfs are preferred when 
leading order matrix element calculations (such as Monte Carlo programs like HERWIG [|T^ 
and PYTHIA |]13|) are used. The differences between LO and NLO pdfs, though, are formally 
NLO; thus, the additional error introduced by using a NLO pdf with HERWIG rather than a 
LO pdf, for example, should not be significant, in principle, and NLO pdfs can be used when 
no LO alternatives are available. The accuracy of current DIS/DY data is such that the 
values for LO fits are noticeably worse than those from the NLO fits: the data are sensitive 
to the differences between LO and NLO partonic cross-sections and evolution kernels. 



3.2 Parametrization of Initial Distributions 

All current global analyses use a generic form for the parametrization of both the quark and 
gluon distributions at some reference value Qq: 

fix, Qo) = aox^^ (1 - xrP{x; as, ...). (1) 

The reference value Qo is usually chosen in the range of 1 — 2 GeV. The parameter ai is 
associated with small-x behaviour while 02 is associated with large-a; valence counting rules. 
In some pdf fits, af has been tied to q^^^^'^^'^^-^ more recent fits like CTEQ4, CTEQ5 and 
MRST, the two small x exponents are allowed to vary independently. The current statistical 
power of the low x and DIS data from HERA warrants this separation. 
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The first two factors, in general, are not sufficient to describe either quark or gluon 
distributions. The term P{x; 03, ...) is a suitably chosen smooth function, depending on one 
or more parameters, that adds more fiexibihty to the pdf parametrization. In general, both 
the number of free parameters and the functional form can have an influence on the global 
fit. For example, the MRS group traditionally uses Pmrs{x; 03, 04) = 1 + asy/x + a^x. The 
CTEQ3 pdf used Pcteqz = 1 + 030; while CTEQ2, CTEQ4 and CTEQ5 all use the more 
general form Pcteq2,4,5 = 1 + a^x""^. The flexibility in the latter form, for example, makes 
possible the larger gluon at high x observed in the CTEQ4HJ pdf. 

Although the pdfs determined from global analyses should, in principle, be universal, in 
practice they could depend on the choice of data sets, and in particular on the choice of Qcut 
values that specify the minimum hard physical scale {Q,Pt, ••) required for data points to 
be included in the fit. 




Figure 3: The parton distributions from the CTEQ5 set plotted at a Q value of 5 GeV. 

The parton distributions from the recent CTEQ pdf release are plotted in Figure ^ at 
a Q value of 5 GeV. The gluon distribution is largest at small x values while the valence 
quark distributions dominate at higher x. 

3.3 Evolution in time and 

As discussed in the introduction, the MRS and CTEQ groups provide semi-regular updates 
to their parton distributions as new data and/or theory becomes available. The latest 
parton distributions are the most accurate and should be used in preference to previous 
pdfs. However, in some cases calculations using older pdfs are necessary: for example, until 
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recently [^none of the more recent pdfs were implemented in PYTHIA, and most comparisons 
in the ATLAS TDR have been made with the CTEQ2L pdf (the default pdf in PYTHIA 
version 5.7). 

A comparison of the CTEQIM [|T4|, CTEQ2M P], CTEQ3M ^ and CTEQ4M § 



parton distributions (in particular the up sea quark and gluon distributions) are shown in 
Figure |, at a value of 5 GeV^. The CTEQ2-4 up quark sea distributions are substantially 
steeper than that of CTEQl, reflecting the influence of the HERA data. A similar effect is 
seen with the gluon distribution. There is little change in the valence distributions. 

The up sea quark and gluon distributions are shown in Figure |^ at a larger value of 
10'^ GeV^. Evolution has evened out many of the differences observed at lower Q"^ values. A 

value of 10'' GeV^ corresponds to a mass scale at the LHC of about 100 GeV. 

The effects of evolution are examined in more detail in Figure ^ where the up sea quark 
and gluon distributions are plotted at values of 2, 10, 50, 10^ and 10^ GeV^. There are 
two interesting features that can be noted. Most of the evolution takes place at low and 
there is little evolution for x values in the vicinity of 0.1. In contrast, at large x value the 
distributions decrease by an order of magnitude from the lowest to the highest value, 
while at small x they increase by an order of magnitude. 



In the most recent version of PYTHIA (6.1), the CTEQ5 pdfs are available. 
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Figure 5: The up sea quark and gluon parton distributions from the CTEQl-4 sets plotted at 
a Q2 value of 10^ GeV^. 
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Figure 6: The up sea quark and gluon distributions from CTEQ4M shown at five different 
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4 Estimating Uncertainties 



In addition to having the best estimates for the values of the pdfs in a given kinematic 
range, it is also important to understand the allowed range of variation of the pdfs, i.e. 
their uncertainties. The crudest method of estimating parton distribution uncertainties is 
to compare different published parton distributions. This is unreliable since most published 
sets of parton distributions (for example from CTEQ and MRS) adopt similar assumptions 
and the differences between the sets do not fully explore the full range uncertainties that 
actually exist. Here and in the next section we concentrate on estimating the uncertainties 
due to to the limitations of available data sets. 

The sum of the quark distributions S(g(a;) +q{x)) is, in general, well-determined over 
a wide range of x and Q^. As stated above, the quark distributions are predominantly 
determined by the DIS and DY data sets which have large statistics, and systematic errors 
in the few percent range (±3% for 10~^ < x < 0.75). Thus the sum of the quark distributions 
is basically known to a similar accuracy. The individual quark flavors, though, may have 
a greater uncertainty than the sum. This can be important, for example, in predicting 
distributions that depend on specific quark flavors, like the W asymmetry distribution 
and the W rapidity distribution. 

Information on the d and u distributions comes, at small x, from HERA and at medium 



X from fixed target DY production on H2 and D2 targets. It is now well-established |T^, |19 
that the d and u distributions are not the same. The difference in these distributions between 
the CTEQ4M and CTEQ5M pdfs is due primarily to the influence of the data from the E866 
experiment. It is worth noting that our detailed knowledge of d/u is limited primarily to 
the X region (.03-. 35) covered by E866. 

The strange quark sea is determined from dimuon production in u DIS (CCFR[^), with 
the strange quark distribution (s + s) being approximately ^(u + d). The charm and bottom 
quark distributions are calculated perturbatively from gluon splitting for given masses of 
and rrib. (See also the previous discussion on schemes.) 

Current information on d/u at large x comes from fixed target DY production on H2 
and D2 and the lepton asymmetry in W production at the Tevatron. In the CTEQ5 and 
MRST fits, the NMC D2/H2 data are used to constrain the large x d quark distribution in 
this way. Bodek and Yang have argued that the D2 data need to be corrected for nuclear 
binding effects, which would lead to a larger d/u ratio at large x (and thus a larger d quark 
distribution as the u quark distribution is well-determined from DIS) |21|. The need for the 



nuclear binding corrections is still an open question |22|. The larger d quark distribution 



would lead to an increase in the high Tevatron jet cross section of about 10%. A similar 
excess would be expected for high jet production at the LHC. 

The parton distribution with the greatest uncertainty is the gluon distribution, simply 
because it does not couple directly to an external probe. The LHC is essentially a gluon- 
gluon collider and many hadron-coUider signatures of physics both within and beyond that 
Standard Model involve gluons in the initial state. Thus, it is very important to estimate 
the theoretical uncertainty due to the uncertainty in the gluon distribution. 

The gluon distribution can be determined indirectly at low x by measuring the scaling 
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violations in the quark distributions (9^2/9 log Q^), but a direct measurement is necessary 
at moderate to high x. Direct photon production has long been regarded as potentially 
the most useful source of information on the gluon distribution with fixed target direct 
photon data, especially from the experiment WA70 |2^, being used in a number of global 
analyses. However, as will be discussed in the next section, there are a number of theoretical 
complications with the use of direct photon data. 

The momentum fraction of the proton carried by quarks is determined very well from 
DIS data; at a Qo value of 1.6 GeV, in the CTEQ4 analysis for example, the momentum 
fraction carried by quarks is 58% with an uncertainty of ±2%. Thus, the momentum fraction 
carried by gluons is 42% with a similar uncertainty. This constraint is important; if the gluon 
distribution increases in one x range, momentum conservation forces it to decrease in another 
X range. Thus, if the gluon flux in the x range from 0.01 to 0.3 were to decrease by 20%, 
the gluon flux would have to increase by a fairly dramatic amount in the other x ranges to 
compensate. For example, if this compensation were to come in the high x region, the gluon 
distribution would have to double. 

A simple way of estimating the uncertainty in the gluon distribution is to systematically 
vary the gluon parameters in a global analysis and then look for incompatibilities with the 
data sets that make up the global analysis database. This study has been carried out by 
CTEQ using only DIS and Drell-Yan data where the theoretical and experimental systematic 
errors are under good control Except at larger values of x{x > 0.2 — 0.3), the variation 
in the gluon distributions is less than 15% at low values of Q^, decreasing to less than 10% at 
high values: as noted earher, evolution is the great equalizer for parton distributions. Note 
that the DIS and DY datasets used in this analysis do not provide any strong constraints 
on the gluon distribution at high values of x. This study used the CTEQ4 value of (i.e. 
0.116). If as is varied in the range from 0.113 to 0.122, the gluon distribution varies by 3% 
for X < 0.15. 

In order to assess the range of predictions for hadronic cross sections, it is more important 
to know the uncertainties in the gluon-gluon and gluon-quark luminosity functions at the 
appropriate kinematic region (in r = X1X2 = s/s) rather than the uncertainties in the 
parton distributions themselves. Therefore it is useful to define the relevant integrated 
parton-parton luminosity functions: for example the gluon-gluon luminosity function can be 
defined as: 

= / —g{xM^)g{Tlx,Q^). (2) 

This quantity is directly proportional to the cross section for s-channel production of a single 
particle and it also gives a good estimate for more complicated production mechanisms. In 
Figure |^ is shown the range of allowed gluon-gluon luminosities (normalized to the CTEQ4M 
values) for the variations discussed above. Here, is taken to be rs, which naturally takes 
the dependence of the gluon distribution into account as one changes ^JT. The top region 
is for the LHC and the bottom is for the Tevatron. Above a -y/r value of 0.1, the allowed 
variation grows dramatically; this indicates the need for more information about the gluon 
distribution at large x than provided by the DIS and DY data sets used in this analysis. 
In analogy with the discussion of gluon-gluon luminosities, one can also study the gluon- 
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Figure 7: The ratio of gluon distributions consistent with the DIS and DY data sets to the gluon 
distributions from CTEQ4M. The gluon distribution from CTEQ4HJ is also shown for comparison, 
in the second figure are shown the corresponding allowable variations in the integrated gluon-gluon 
luminosity as a function of ^/r. 



Table 1: The parton-parton luminosity uncertainty as a function of y/r. 

•^r range gluon-gluon gluon-quark 



< 0.1 ±10% ±10% 

0.1 - 0.2 ±20% ±10% 

0.2 - 0.3 ±30% ±15% 

0.3 - 0.4 ±60% ±20% 



quark luminosity (again normalized to the CTEQ4M result). The uncertainties on the 
parton-parton luminosities, as a function of ^/T, are summarized in Table 1. Note that the 
region of production of a 100 — 140 GeV Higgs at the LHC lies in the region where the range 
of variation in the gg luminosity is ±10%. 



5 Direct Photons and Jets in Global Fits 
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5.1 Direct Photons 



As mentioned previously in this section and in Reference p5[, direct photon production has 



long been viewed as an ideal vehicle for measuring the gluon distribution in the proton. 
The quark-gluon Compton scattering subprocess {gq •yq) dominates photon production 
in all kinematic regions of pp scattering, as well as for low to moderate values of parton 
momentum fraction x in pp scattering. As described previously, the gluon distribution is 
relatively well constrained at low x{x < 0.1) by DIS and DY data, but less so at higher 
X. Consequently, fixed target direct photon data have been incorporated in several modern 
global parton distribution function analyses with the hope of providing a major constraint 
on the gluon distribution at moderate to high x. 

A pattern of systematic deviations of direct photon data from NLO predictions has been 



observed [PGj |27| , however, these being particularly striking for the E706 experiment. The 



origin of the deviations is still quite controversial. One possibility that has been suggested 



is that the deviations are due to the effects of soft gluon radiation, or kx p3 HI- This 



view, however, is not universally held; see, for example, the discussion in Reference P3[ and 



in Reference [0. The kx values needed to describe the data are too large to be viewed as 
purely 'intrinsic' or non-perturbative in origin. But, as discussed in Reference in the 
standard formalism for direct photon production there are no double-logs to be resummed. 
This is in contrast to double-arm observables such as Drell-Yan or diphoton production; since 
direct photon production is, by definition, a single-arm observable, there is no restriction 
of phase space for gluon emission, and thus no double logarithmic enhancement to the pt 
distribution. The only enhancement effects that survive arise from the purely 'intrinsic' kx 
present in the colliding hadrons. 

Nonetheless, there is generally a substantial amount of k^ that results from the emission 
of soft gluons in hard scattering processes. Direct evidence of this kx has long been evident in 
Drell-Yan, diphoton and heavy quark measurements. The values of < kx >/ parton for these 
processes vary from 1 GeV at fixed target energies to 3 — 4 GeV at the Tevatron Collider. 
The growth is approximately logarithmic with center of mass energy. (The value expected 
at the LHC for relatively low mass states (30 — 40 GeV) is in the range of 6.5 — 7.0 GeV.) 

Perturbative QCD corrections are insufficient to explain the size of the observed kx and 
fully resummed calculations are required to explain Drell-Yan, W/Z and diphoton distribu- 
tions ^0 . These resummed calculations qualitatively describe the growth of the < kx > 



with center-of-mass energy. Currently there is no rigorous kx-type resummation calculation 
available for single photon production, for the reasons cited above. In addition, this calcula- 
tion is quite challenging in that the final state parton takes part in soft gluon emission and in 
color exchange with initial state partons, in contrast with the Drell-Yan and diphoton cases. 
Also, the calculation is complicated by the fact that several overlapping power-suppressed 
corrections can contribute and, at high x, threshold effects are important. 

Nevertheless, there has been recent theoretical progress in single photon resummation |^T], 



32| , p3| , . In particular, in Reference ||3^ , a technique has been presented for simultaneously 
treating recoil and threshold corrections in single photon inclusive cross sections, working 
within the formalism of coUinear factorization. In the preliminary results, substantial en- 
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hancements have been observed, at moderate pt and x, from higher order perturbative and 
power-law non-perturbative corrections. This approach is still quite new and the efficacy of 
the formalism still has to be evaluated. 

There is an intuitive picture that describes the effects of this soft gluon radiation, both 
perturbative and non-perturbative, on the direct photon cross section. The presence of soft 
gluon radiation, or k^, can give a 'kick' in the photon direction. Due to the steeply falling 
cross sections, the kick can lead to the promotion of photons from lower px to higher values 
of Pt- The more steeply falling the cross section, the larger the resulting enhancement. Using 
this intuitive picture, the effects of soft gluon radiation can be approximated by a convolution 
of the NLO cross section with a Gaussian kx smearing function. The value of < /c^ > 
to be used for each kinematic regime should be taken directly from relevant experimental 
observables, given the lack of a rigorous formalism, rather than from a theoretical prediction. 
The behaviour of the kx smearing correction is quite different for the Tevatron collider and for 
fixed target experiments. For the Tevatron, there are two points to note: (1) the agreement 
with the data is improved if the fc^ correction is taken into account and (2) the kx smearing 
effects fall off roughly as l/px [0- The latter behaviour is the expectation for such a power- 
suppressed type of effect and is the behaviour expected at the LHC, where the effects of the 
kx smearing should not be important beyond px values of 30 GeV 1^. 

The kx correction obtained for E706 at a center-of- mass energy of 31.6 GeV is shown 
in Figure |[ The value of < kx > of 1.2 GeV was obtained from measurements of several 



kinematic observables in the experiment ||2^. The kr smearing effect is much larger here 
then observed at the collider and does not have the 1/px falloff. Also shown are the kx 
corrrections using values of < fc^ > of 1-0 and 1.4 GeV (a reasonable estimate of the range 
of experimental uncertainty in the < kx > determination). In addition, the fcy correction 
for the E706 data used in the recent MRST pdfs is shown. The MRST kj- correction, 
utilizing a different model, is larger leading to a smaller gluon distribution in the relevant 
X range. (Both the CTEQ4 and MRST pdfs, with their respective kx corrections, lead to 
good agreement with the E706 direct photon cross sections.) The differences between the kx 
correction from Reference and that from the MRST pdfs can be taken as an indication 
of the uncertainty in the value of this correction. Good agreement with the E706 direct 
photon and cross section at ^/s = 31.6 GeV is observed when the nominal fcr correction 
of 1.2 GeV is used; however, the allowed range of variation of (/c^) (1.0 — 1.4 GeV) makes 
quantitative comparisons, and thus an extraction of the gluon distribution, difficult Since 
the high px E706 data agrees well with CTEQ4M, it would thus disfavor the CTEQ4HJ pdf. 
As stated before, however, a definitive conclusion must await a more rigorous theoretical 
treatment. 

Other related fixed target processes, such as vr'' production, in the same pt range as the 



-"^^Similar kr smearing effects should be present in all hard scattering cross sections, for example jet 
production at the Tevatron. The size of the experimental and theoretical systematic errors in the low Et 
region make such a confirmation difficult. 

^^NLO QCD predictions for fixed-target direct photon production (as is also true for other fixed target 
processes) also contain a non-negligible renormalization and factorization scale dependence, as discussed in 
Reference p5| 
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Figure 8: The variation of kr enhancements (ratio of cross sections with and without the kr 
corrections) relevant to E706 direct photon data at 31.6 GeV, for different values of average kx- 
In addition, the kx correction for E706 used in the recent MRST fit is indicated. 



measured direct photon cross section, may perhaps shed some hght on the puzzle. It has 
been noted ||35[ that essentially all of the fixed target vr^ cross sections disagree with NLO 
predictions, by essentially a constant factor. Thus, there may be a common problem causing 
the deviations, such as uncertainties in the high z quark and gluon fragmentation functions 
and possible sizeable higher order corrections. In addition, the importance of the high z 
fragmentation region implies the need for threshold resummation techniques to be applied, 
in processes with non-trivial color flow. 

It is worthwhile pointing out, though, that the same kx model used for for single photon 
production was shown to also provide an adequate description of the experimental 7r° cross 
sections pp| , |5B| . As in the case of direct photon production, the controversy regarding the 
theory/data discrepancies is still open. The vr^ cross sections may form a crucial role in the 
ultimate understanding for a number of reasons: if fcy are important for photon production, 
they should also have a measureable impact on the vr" cross sections as well. In addition, 
tt'^'s form the primary experimental background to direct photon production. 

Finally, it is not clear if any theoretical treatment for photon production is capable of 
describing all of the current fixed target direct photon data. There are discrepancies between 
the different experiments which may imply experimental difficulties, which are in addition 
to any of the theoretical problems discussed above. 
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5.2 Influence of Jets 



An important process that is sensitive to the gluon distribution is jet production in hadron- 
hadron colhsions. Processes responsible for jet production include gluon-gluon, gluon-quark 
and quark-quark (or anti-quark) scattering. Precise data on jet production at the Fermilab 
Tevatron are now available over a wide range of transverse energy, and the theoretical un- 
certainties in most of this range are well-understood. Thus, it is to be expected that jet 
production can provide a good constraint on the gluon distribution. 

The jet data that has been utilized in global pdf fits has been from the CDF and DO 
collaborations The data cover a wide kinematic range {Et values from 15 to 450 GeV 
corresponding to an x range of 0.02 to 0.5). The CDF jet data from Run lA were utilized 



in the CTEQ4HJ pdf fit ^7j. Here, a large emphasis was given to the high Ej- data points 
which show a deviation from NLO QCD predictions with "conventional" pdfs. Given the lack 
of constraints on the high x gluon distribution discussed in Section VI, the extra emphasis 
on the high Et region was enough to cause a significant increase in the gluon distribution; 
for example, the gluon distribution at an x value of 0.5 {Q = 100 GeV) increases by a factor 
of two. Since the dominant jet subprocess in this region is qq scattering the increase in the 
gluon distribution of a factor of two causes only a 20% increase in the jet cross section. This 
is sufficient to pass through the bottom of the CDF high Et jet error bars. The preliminary 
jet cross sections from Run IB (90 pb"^) from both the CDF and DO experiments were used 
in the CTEQ4M fits, but with statistical errors only and only for Et in the range 50 — 200 
GeV. The points with Et lower than 50 GeV have substantial systematic errors on both the 
theoretical and experimental sides while the points with Et higher than 200 GeV contain 
the CDF excess. The inclusion of the jet data serves to considerably constrain the gluon 
distribution over the x range of 0.1 to 0.2. The resulting gluon (CTEQ4M) does not decrease 
the excess observed by CDF at high Et- 

The published DO jet cross section |^ along with the (soon-to-be published) CDF jet 
cross section |^| from Run IB were used in the recently released CTEQ5 parton distribu- 
tions. The fits use the full Et range for the cross sections and use the correlation information 
on the systematic errors as contained in the covariance matrices for both experiments. The 
two experiments are in agreement with each other except for a slight normalization shift 
the two highest Et data points for CDF are above those for DO, but both experiments have 
large statistical errors in this region. As can be seen in Figure § the NLO QCD predic- 
tion with the CTEQ5M pdf is in good agreement with the CDF data. The conclusions are 
exactly the same for the DO jet data. The CTEQ5M gluon is very similar to CTEQ4M, 
except perhaps at very high x. The CTEQ4HJ pdf has been updated to complement the 
new CTEQ5M pdf. The CTEQ5HJ pdf gives almost as good a global fit as CTEQ5M to the 
full set of data on DIS and DY processes, and has the feature that the gluon distribution 
is significantly enhanced in the high x region, resulting in improved agreement with the 

^^The experimental and theoretical errors associated with the UA2 jet cross section make its use in pdf 
fits difficult. 

A shift on the order of 3% is expected since the two experiments use values for the total inelastic cross 
section that differ by that amount. 
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Figure 9: A comparison of the Run IB CDF inclusive jet cross section to the CTEQ5 fits. The 
bottom plot shows the measured cross section multiplied by in order to allow a linear display. 
The top plot shows the ratio of the measured cross section to that calculated with CTEQ5M, as 
well as the ratios of CTEQ5HJ to CTEQ5M. 

observed trend of jet data at high Et in both the CDF and DO experiments. 



6 Systematic Uncertainties 

There is currently an increasing awareness of the need and possibihty of propagating errors 
in the data into error estimates on parton distribution functions Ideally, one 

might hope to perform a full error analysis and provide correlated errors for all the parton 
distributions determined in a global fit. This goal is difficult to carry out for several reasons. 
Firstly, there is no established way of quantifying the theoretical uncertainties for the diverse 
physical processes that are used. More pragmatically, only a subset of the experiments 
usually involved in global analyses provide correlation information on their data sets in 
a way suitable for the analysis. In these circumstances, comparing data from different 
experiments becomes very difficult. Furthermore the standard fitting procedure introduces 
methodological uncertainties due in particular to the necessity of choosing specific choices 
of parametrization. All of these uncertainties are of course all highly correlated. We discuss 
each in turn. 



6.1 Theoretical Uncertainties 



The most important theoretical uncertainty in the determination of parton densities is the 
truncation of the resummed perturbation series at NLO. Consistent NNLO determinations 
will require NNLO splitting functions: there has recently been some progress in this di- 
rection [^, and it is hoped that NNLO calculations might be available before the LHC 
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is turned on. Meanwhile there are some 'approximate NNLO' calculations which at- 
tempt to reconstruct the NNLO splitting functions from their known integer moments and 
behaviour at large and small x: these analyses suggest that NNLO corrections might reduce 
theoretical uncertainties due to truncation of the perturbative expansion by at least a factor 
of two. 

One of the most important consequences of the theoretical uncertainty from unknown 
NNLO corrections is that it currently limits the accuracy of most of the experimentally more 
reliable determinations of a^. This in turn inevitably limits the accuracy of all extrapolations 
from low to high Q^: for example one of the largest uncertainties in the prediction of the W 



and Z cross-sections is that due to the uncertainty in as ^\ ■ 

Uncertainties at low due to higher twist may be estimated from phenomenological 
fits: recent studies [pT| , |48|] have shown that there are important correlations between em- 
pirical higher twist and the value of a^. It has also been shown that the fitted higher twist 
contribution drops when estimates of NNLO corrections are included |^^. The empirical 
higher twist is qualitatively consistent with renormalon estimates. Taken together, these 
observations suggest that it is difficult to disentangle genuine higher twist from higher order 
perturbative corrections: the true higher twist contribution may be much smaller than is 
suggested by the fits. 

The correct treatment of heavy quarks close to threshold was developed some time 
ago [l^; more recently it was proven that this procedure works to all orders in pertur- 
bation theory |]50|. This treatment is now included in some of the CTEQ fits |1|, a 
closely related but not identical procedure is used by MRS 0. A simpler version of ACOT, 
which nonetheless accurately reproduces its essential features, has also been developed [p^. 

An accurate treatment of heavy quark production, and indeed W and Higgs production, 
requires the resummation of threshold logarithms. Recently it has been suggested that 
resummation of soft gluons may solve some of the problems with prompt photons ||31 



33| , [3^ . A fully consistent treatment will require the inclusion of soft gluon resummations in 



parton determinations, but as yet this has not been attempted. Renormalon studies suggest 
that such resummations may substantially improve the reliability of perturbation theory at 
large x. Again there will be strong correlations with higher twist. It would be particularly 
interesting to see the effect of such resummations on the predictions for the parton-parton 
luminosities eq.^ in the region relevant for Higgs production at the LHC 

The resummation of high energy (small x) logarithms is more problematic. Present data 
suggest that their effect on inclusive cross sections must be very small, at least at HERA and 
the Tevatron if not at the LHC. Furthermore, conventional theoretical approaches |53, ^ 
based on summations of LLx and NLLx \EM corrections have been shown to break down: the 



NLLx corrections are overwhelmingly large and negative |^. Various suggestions for the 
resummation of these large corrections have been put forward |5^, ^ Hopefully a 
detailed phenomenological analysis based on one or other of these procedures will eventually 
provide a reliable estimate of the error due to uncertainties in small x evolution when using 
parton distributions measured at HERA to predict those to be used at the LHC. 
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6.2 Combining Different Experiments 

On the experimental side, one of the major problems with combining results from different 
experiments lies in the degree of 'rigour' in the interpretation of the experimental errors. 
Experimental results may be conveniently expressed as probabilities P(data|theory), i.e. 
the probabilities of obtaining the given set of data given a certain theoretical prediction 
P^ . Often these probabilities are expressed in terms of predictions and (Gaussian) errors: 
for a given experiment, P{d\t) = exp(— where d are the data, t the theoretical 
predictions and 

x'{d\t) = Y.id-t)j:-\d-t) (3) 

data 

where S is the matrix of correlated errors. Maximizing the probability, and thus obtaining 
the most likely 'prediction', then corresponds to minimizing the x^- K should be emphasized 
that it is not necessary to present experimental results in this way, and in particular some 
systematics may be completely non-Gaussian; however if the experiment is to be useful it 
must always provide a (clear) estimate of P{d\t), otherwise the error analysis is at best 
incomplete and at worst useless. 

In the present situation, the predictions will be constrained functionals of the input 
pdfs (the constraints being the result of perturbative evolution and cross-sections). If the 
errors have been estimated correctly, and the theory which constrains the predictions is 
sufficiently accurate, then there should be pdfs for which the per degree of freedom is 
of order unity. Unfortunately for many important datasets this is not the case, and thus if 
one were to insist on the rigour of the statistical method, then many important experiments 
would not be included in the analysis [^2|, |7l| . Such a strict criterion is probably unrealistic: 



rather the emphasis should be placed on using the maximal experimental constraints from 
experimental data I^TJ. In this case the standard statistical techniques may not apply, but 
must be supplemented by physical considerations, taking into account experimental and 



theoretical limitations [43 



As an example of how this works in practice, we consider a recent CTEQ error analysis 
of the ly-production cross-section This uses the standard CTEQ5 analysis [0] as 

the starting point: there are fifteen experimental data sets, with a total of ~ 1300 data 
points, and experimental errors are generally treated by ignoring correlations and combining 
statistical and systematic errors in quadrature (so S in eq.(^ is taken to be diagonal, with 
each diagonal entry set to a'^^^^ + a'^^^^ of the corresponding data point). The initial pdfs 
are parameterised by 18 parameters ai,i = 1, . . . , 18: each theoretical prediction is then a 
function of these parameters. The 'best-fit' distribution (CTEQ5M1 in this case) is then 
given by the set of parameters a which minimise J2exptsJ2da.taX'^{d\t[f{a)]), where t[/(a)] 
are the theoretical predictions for each data point given the pdf /(a) for the fifteen base 
experimental data sets. 

A natural way to find the limits of a physical observable which depends on the pdfs, call 
it 0[/(a)], such as the VT-production cross-section aw at y/s = 1.8 TeV, is then to study the 
dependence of the total on O. An efficient way of doing this is to use Lagrange's method 
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W-production X-sec. at the Tevatron 



W-production X-sec* Brigp at the LHC 
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Figure 10: of the base experimental data sets vs. the W production cross-section at the 
Tevatron and LHC. 



of undetermined multipliers: one minimizes 



^(A)= E Y.x'mfm + XO[f{a)] 

expts data 



(4) 



for fixed A, and then varies A in order to map out the as a function of O. 

Figs. p!0|a,b show the for fifteen base experimental data sets as a function of aw 
at the Tevatron and LHC energies respectively [^]. Two curves with points corresponding 
to specific global fits are included in each plot[^ one obtained with all experimental nor- 
malizations fixed; the other with these included as fitting parameters (with the appropriate 
experimental errors). We see that the x^'s for the best fits corresponding to various values 
of the W cross-section are close to being parabolic, as expected. Indicated on the plots are 
3% and 5% ranges for aw The two curves for the Tevatron case are farther apart than 
for LHC, reflecting the fact that the W-production cross-section is more sensitive to the 
quark/ anti- quark distributions and these are tightly constrained by existing DIS data. 

The important question is: how large an increase in should be taken to define the likely 
range of uncertainty in 07 The elementary statistical theorem that A^^ = 1 corresponds 
to one standard deviation of the measured quantity O relies on assuming that the errors 
are gaussian, uncorrelated, and with their magnitudes correctly estimated. Because these 
conditions do not hold here, this theorem cannot be naively applied quantitatively: rather 



one must examine in detail how well the fits along the parabolas shown in Fig.|TO| compare 
with the individual precision experiments included in the global analysis, in order to arrive 
at reasonable quantitative estimates on the uncertainty range for the W cross-section. In the 
meantime, based on past (admittedly subjective) experience with global fits, it seems that 
a difference of 40 — 50 points represents a 'reasonable' estimate of current uncertainty of 
parton distributions. This implies that the uncertainty of aw is about 3% at the Tevatron, 
and 5% at the LHC. 



19n 



'The third Une in Figs. |l0| a refers to an alternative technique |42 based on the assumption of Gaussian 
errors in the parameters Ui. 
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6.3 Correlated Experimental Systematics 



There is now an increasing awareness of the necessity and possibihty of carrying out a careful 
treatment of correlated systematic errors when attempting to determine errors on pdfs. For 
example a systematic study of the uncertainties in the parton distribution in the small x 



region has been made recently by experimentalists at HI and ZEUS [62, n3|. These studies 



include a proper treatment of correlated systematic errors, and some attempt is made to 
quantify parametrization uncertainties. Similar studies of the errors in polarized parton 
densities have been made by the SMC |6^. Besides showing that careful estimates of parton 
uncertainties are useful and necessary, these studies also show that it is possible to include 
correlated systematics and combine data sets from different (albeit similar) experiments 
in a meaningful way. However they also show that doing something similar for a global 
parton determination would be very difficult and extremely tedious, unless new techniques 
are developed. 

The importance of correlations in experimental systematic errors has been underlined by 
a recent reanalysis |^5[ of the F2 BCDMS data. A more careful treatment of the correla- 
tions between data taken at different beam energies, and the correlations between the fitted 
parton distributions and higher twist, results in a significant increase in the value of as(M|) 
extracted from the data: Alekhin quotes a value of 0.118 ± 0.002. This is consistent with 
the current world average and the value 0.119 ±0.002 recently extracted from the reanalysed 
CCFR data [^] (though after a more careful treatment of correlated higher twist this 
rises to 0.122 ± 0.005). 

In this context it should be noted that in the usual global analyses, in which correlations 
between systematic errors are ignored, and higher twist effects are not included, neither the 
BCDMS or the CCFR F2 data show a minimum in their ^is is varied p|, |T^, despite 
the fact that when treated separately each is capable of yielding an excellent determination 
of as- Only the minima in the HI and ZEUS datasets are strong enough to survive this 
treatment: this may be helped by the fact that empirical higher twists are very small at 
small X It will be interesting to repeat the preliminary determination |^ using the 



95-97 HERA datasets when these finally become available. 



6.4 Methodological Issues 

While the issues addressed in the previous three sections are no doubt all important, there 
are also some methodological issues which need to be considered if we are to achieve our aim 
of a reliable determination of the errors in a global determination of parton distributions. 
In particular, we need a technique which can give parton distribution functions and their 
errors, such that: 

(i) there is no inbuilt methodological bias (for example dependence on a particular 
parametrization of the input distributions) 

(ii) it is easy to propagate the effects of correlated systematic errors in the data to 
correlated uncertainties in the parton distributions 

(iii) it is easy to add new data sets or estimate theoretical errors or test models of new 
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physics without redoing the whole of the analysis. 

All of these criteria can be met if we 'quantise' our parton distributions: instead of trying 
to determine a single 'best fit' set of parameterised parton distributions with an associated 
error matrix, we construct an ensemble of sets of partons, distributed according to how well 



they fit the data [H2], ITO, IT^, |73|, WM . The expected result for a parton dependent observable, 



call it 0[f], would then be given by an ensemble average: 

{0[f]) = J [Vf] 0[f] J[f] s[f] n P{d\m, (5) 

expts 

where J['Df] means functional integration over all possible input distributions / (subject to 
basic constraints such as sum rules and positivity) and Z = (1) is a normalization factor. The 
measure of integration is given essentially by the probability distributions P{d\t[f]) for each 
of the experiments used as input. These probabilities are, as explained above, the essential 
input of the experimental data used in the fit: they support distributions which fit the data 
well, and suppress the contribution of distributions which fit badly. If the errors on the data 
were assumed Gaussian, these probabilities would come in the form of a x^y i^i 6q.@, 
though the technique does not depend on such an assumption, and non Gaussian errors 
could also be incorporated. There is also a Jacobian factor J[f], which turns the integration 
measure from an integration over theoretical predictions t[f] to one over the pdfs themselves, 
and enforces the theoretical constraint that the theoretical predictions are related through 
pQCD. It is also necessary to introduce a 'smoothness' factor s[f] into the measure, to 
enforce the natural theoretical prejudice that the initial pdfs should be smooth functions of 
X, without wiggles or jumps: a suitable form for such a factor would be exp —^£j2x{dxf)'^, 
where e is a small parameter which quantifies the extent of this prejudice. Final results 
should be independent of the form of this term, and in particular the parameter e, provided 
that it is varied in a suitable range. 

The way in which this procedure works should now be clear, since it is similar to the 
quantum mechanics (or more precisely statistical mechanics) of a particle in a (highly non- 
local) potential |7^: the parton distributions may be thought of as quantum fields, with, in 
the case of Gaussian experimental errors, the action 

m = lT.T.id-t[m-\d-m + leY.{dJ?. (6) 

expts data a; 

The best fit parton distribution is then the solution of the classical equations of motion (since 
it minimises the action), while the error bands are given by the 'quantum' fluctuations around 
the classical field. Since the determination of the classical field is itself nontrivial, the system 
is best solved numerically: we discretise the field by introducing a parametrization with a 
finite number of parameters ai, i = 1, N, so that JlVf] J[f] — > Hi J{ai), rather as we 
would for a lattice field theory. Here the best discretization would not necessarily be a naive 
discretization in xbj with spline interpolation: rather it might involve expansion of each pdf 
in sets of orthogonal polynomials, or other sets of (orthogonal) functions, for example eq. (|I|) 
and its obvious generalizations. The integration over the parameters a would then be done by 



Monte Carlo, using an algorithm such as Metropolis or HMC to generate an ensemble 
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of configurations distributed according to the measure of integration, and thus according 
to its hkehhood given the input datasets. Finding each such configuration will involve a 
similar computational effort to that of finding a best fit. Finally, we would like to increase 
the number of parameters N (taking the 'continuum limit') until we are sufficiently close 
to a truly parametrization independent ensemble, at which stage we can readily compute 
expectation values of observables and their associated errors as averages over the ensemble 
of pdfs. 

This procedure has several advantages: 

(i) it is intrinsically parametrization independent as the number of parameters increases, 
because of the universality of the continuum limit. Flat directions are no longer the problem 
that they are in a best fit procedure: the total number of parameters is now limited only by 
computational resources. Indeed the fiat directions are now interesting, since they give the 
most important uncertainties in the parton distribution functions. 

(ii) the propagation of correlated systematics is automatically taken care of by the pro- 
cedure. The only limitation is the reliability of the probabilities P{d\t) produced by ex- 
perimentalists. This should give added impetus to the determination of meaningful (and 
thus comparable) estimates of systematic errors by different experimental collaborations, 
and their presentation in such a way that they can be readily input into such an analysis. 
Preliminary explorations of the technique [^] indicate that the errors in the pdf parameters 
are not only highly correlated, but also in many cases significantly non-Gaussian, even when 
the errors in the data are assumed to be Gaussian. 

(iii) Data from new experiments can be added using the old configurations, since dif- 
ferent experiments are (in principle!) statistically independent, so iStot[/] = Z^expts '^exp [/] • 
Similarly we could estimate theoretical errors due, for example, to NLO truncation, by using 
the standard configurations reweighted by varying renormalization and factorization scales. 
Similarly, we could test the effect of resummations by reweighting the configurations gener- 
ated using standard NLO evolution, or indeed test models for new physics by reweighting 
the configurations generated using the Standard Model [fflH . 

The main problems to be faced in actually implementing the procedure are computa- 
tional: we need a fast evolution code, and high performance computing. The advantages of 
parallelization should be obvious. In fact the computational requirements are very similar 
to those of the lattice gauge theorists: calculating the 'action' is more difficult, but the 
'continuum limit' should be reached much more quickly. 



7 From here to the LHC and Beyond 
7.1 Progress Before the LHC Turns on 

Perturbative QCD has been extremely successful in describing data in DIS, DY and jet 
production, as well as describing the evolution of parton distributions over a wide range of 
X and Q^. From the point of view of pdf determination, the primary problem lies in the 
calculation of the direct photon cross sections which could serve as a primary probe of the 
gluon distribution at high x. However, a rigorous theoretical treatment of soft gluon effects 
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(perhaps requiring both kx and Sudakov resummation) will be required before the data can 
be used with confidence in pdf fits. On the experimental side, it will also be necessary to 
resolve the inconsistency between the WA70 and E706 data. 

DO has recently presented a new result for the measurement of the inclusive jet cross 
section as a function of the jet rapidity (up to values of three) |7^. Such a measurement 
probes a greater kinematic range than the central inclusive jet cross sections. In addition, the 
differential dijet data from the Tevatron explore a wider kinematic range than the inclusive 
jet cross section. Both CDF and DO have dijet cross section measurements from Run I 
which may also serve probe the high x gluon distribution, in regions where new physics is 
not expected but where any parton distribution shifts should be observable. The ability to 
perform such cross-checks is essential. 

CDF and DO will accumulate on the order of 2-4 fb''^ of data in Run II (from 2000-2003), 
a factor of 20-40 greater than the current sample. This sample should allow for more detailed 
information on parton distributions to be extracted from direct photon and DY data, as well 
as from jet production. Run III (2003-2007) offers a data sample potentially as large as 30 
fb-\ 

HI and ZEUS will continue the analysis of the data taken with positrons in 1991-97. 
HERA switched to electron running in 1998 and plans to deliver approximately 60 In 2000, 
the HERA machine will be upgraded for high luminosity running, with yearly rates of 150 
integrated luminosity of about 1 fb~^ by 2005. This will allow for an error of a few percent 
on the structure function F2 for scales up to 10'^ GeV^. The gluon density, derived from 
scaling violations of F2, should be known to an accuracy of less than 3% in the kinematic 
range 10~^ < x < 10~^. 

It is also hoped that over the next five years the Monte Carlo outlined in the previous sec- 
tion will begin to bear fruit, perhaps to the point where they can make a serious contribution 
to global pdf error analysis. 



7.2 Physics cross sections at the LHC and the role of LHC data 
in pdf determination 

ATLAS measurements of DY (including W and Z), direct photon, jet and top production will 
be extremely useful in determining pdfs relevant for the LHC. The data can be input to the 
global fitting programs, where it will serve to confirm/ constrain the pdfs in the LHC range. 
Again, DY production will provide information on the quark (and anti-quark) distributions 
while direct photon, jet and top production will provide, in addition, information on the 
gluon distribution. 

Other processes might also prove useful. For example diphoton production might be 
useful for determining the gluon distribution, and this in turn would lead to an improved 
knowledge of the relevant parton pdfs and parton-parton luminosity functions for the produc- 
tion of the Higgs (which is largely due to gg scattering for low to moderate Higgs' masses). 

Another possibility that has been suggested is to directly determine parton-parton lumi- 
nosities (and not the parton distributions per se) by measuring well-known processes such 
as W/Z production ||4^. This technique would not only determine the product of parton 
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distributions in the relevant kinematic range but would also eliminate the difficult measure- 
ment of the proton-proton luminosity. It may be more pragmatic, though, to continue to 
separate out the measurements of parton pdfs (through global analyses which may contain 
LHC data) and of the proton-proton luminosity. The measurement of the latter quantity 
can be pegged to well-known cross sections, such as that of the W/Z, as has been suggested 
for the Tevatron. 

8 Conclusions 

The determination of parton distributions and uncertainties is an important ingredient of our 
preparations for physics at the LHC. The global fitting techniques used for the past fifteen 
years may soon be superseded by more sophisticated methods. Developing and exploiting 
these techniques will be a great challenge to theorists and experimentalists alike. 
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Generalized factorization and resummation 



C. Balazs, J.C. Collins, D.E. Soper 



Abstract 

In this section we summarize the formahsm which extends the usual hadronic fac- 
torization theorem to the low transverse momentum region for the inclusive production 
of colorless final states, while resumming logarithms with the ratio of the invariant mass 
and transverse momentum. Among the various recent applications the calculation of 
the and Higgs boson transverse momentum distributions are highlighted. 

1 The Collins-Soper-Sterman formalism 

The standard factorization formula fails near kinematic boundaries. We discuss the case 
of low transverse momentum in Zq production, etc; this is an important case because the 
cross section peaks there. The failure of the factorization formula is symptomized by large 
corrections involving a factor of In^ Q/Qt for each power of a^. 

Although the solutions to the problem are all commonly referred to as "resummations" , 
there are in fact two very different approaches. One is resummation in its strict sense: One 
performs a selective and approximation summation of the largest parts of the perturbative 
series for the hard scattering in the standard factorization formalism. 

The second approach is that of Collins, Soper and Sterman (CSS) ^. These authors 
observed that the conventional factorization formalism is in fact wrong at low transverse 
momentum and they derive a correct factorization for this region. In an intermediate region 
of transverse momentum, the standard factorization with resummation is applicable with 
somewhat reduced accuracy, and there is an overlap between the two approaches, which we 
will discuss later. 

In any case, it is essential to improve on the standard fixed-order factorization formalism, 
and the reward is an improved method that 

• includes large, logarithmic QCD corrections up to all orders in the strong coupling, 

• improves the renormalization scale dependence of the prediction, 

• enables prediction of certain quantities reliably, which cannot be done in a fixed order 
calculation, 

• provides an independent, analytic check for parton shower Monte Carlo's. 
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1.1 /cy-dependent parton densities 



CSS realized that the failure of the standard factorization when Qt ^ Q occurs because it 
neglects the transverse motion of the incoming partons in the hard scattering. (Here Q can 
be the invariant mass of a colorless particle, or set of particles, created in a hard partonic 
collision, and Qt is the related transverse momentum.) The approximation of neglecting 
parton transverse momentum is only valid when the cross section is integrated over a large 
range of Qt- But if, for example, Qt is of order 1 GeV, then we are outside of the domain 
in which the factorization is applicable. 

A fully satisfactory approach must use a factorization theorem that is valid for any Qt 
that is small compared to Q. CSS's theorem gives the cross section as a convolution of 
transverse momentum distributions 



where P is a partonic density distribution that is a function of both longitudinal (x) and 
transverse (fc^) momenta. The partonic recoil against soft gluons as well as the intrinsic 
partonic transverse momentum are included in P. 

Such a treatment completely formalizes the intuitive notion that partons must have trans- 
verse momentum and that this transverse momentum gives rise to transverse momentum of 
the Drell-Yan pair. There is then no need to convolute a calculated cross section with "in- 
trinsic transverse momentum" for the quarks; this manoeuvre is only necessary as an ad hoc 
correction to a formalism that is incomplete. 

In QCD, complications arise from soft-gluon effects, because these effects do not cancel, 
in contrast to the case of the cross section integrated over Qt- A consequence, proved by 
CSS, is a particular form of the evolution equations for the /cT-dependent parton densities. 
These equations are not the normal DGLAP equations^. The kernel of the evolution contains 
a perturbatively calculable part and non-perturbative part. The non-perturbative part can 
be summarized by saying that there is a fixed amount of gluon radiation per unit rapidity, 
so that the transverse momentum distribution of the partons broadens in a characteristic 
way with energy. The non-perturbative part of this energy-dependent radiation is fitted by 
the (?2 term of Eq.(^) below. 

This feature may be the dominant reason why transverse momentum distributions are 
so broad at high energies, as in Z° production: the transverse momentum of the has 
a component due to the recoil against non-perturbative glue emitted into many units of 
rapidity. 

The CSS formalism clearly entails a phenomenological fitting of the non-perturbative part 
of the /cT-dependent parton densities and the evolution kernel. In principle, this can be done 
at fairly low energy, and then the evolution equations predict the results for higher energies 
with no further adjustable parameters. The more conventional resummation formalism is 
compatible with the CSS formalism, but it is not as complete. 

^ Although all the physics associated with the DGLAP equations is present. 
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Figure 1: Transverse momentum distribution of electron-positron pairs from decays of (mostly) 
bosons, produced at the Tevatron in = 1.8 GeV center of mass proton-anti-proton 
collisions. The data are CDF preliminary [Q], and the curve is calculated by the ResBos Monte 
Carlo event generator H]. 
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Because the CSS formalism is designed to treat correctly the Qt ^ Q region, it also 
provides an appropriate resummation of the large logarithms, hi{Q/QT) in the standard 
factorization formula. 

We can gauge how important these logarithms are in practice by examining the cross 
section for Z production at the Tevatron. The bulk of the cross section is in the low Qt 
region, and, as can be seen from Fig. |l|, there is a peak at around Qt = 2.7 GeV, which 
is much smaller than the invariant mass Q = mz = 91.187 GeV. This implies that for the 
bulk of the events ln(Q/Qr) is large enough that as{mz) \n.^{Q/QT) > 1- Since we have a 
double logarithm for each radiated gluon, higher orders in the perturbative series are not 
suppressed. 



1.2 From fixed order to resummed 

In this section we show how the results of the standard factorization theorem are related to 
a resummation in terms of leading logarithms, etc. 

When the is produced in a hadron-hadron collision its transverse momentum is bal- 
anced by some hadronic activity which stems from partons emitted by the initial state 
partons. (In the first order in the strong coupling a Z° and a gluon is produced.) The Qt 
distribution given by the usual factorization in the low Qt region is written as 

where the coefficients „f ^ are perturbatively calculable apart from some factors of par- 
ton densities. When the two scales Q and Qt are very different, the logarithmic terms 
\n"^{Q'^/Q^) are large, and for Qt <^ Q the perturbative series is dominated by these terms. 
For Qt ^ Q truncation of the perturbative series, i.e. any fixed order calculation, gives an 
answer which neglects these important all order logarithmic contributions. At the lowest 
order, the Z° boson is produced alone, that is with a Qt distribution of S{Qt)- The 

singularity at Qt = prevails at any fixed order in as, as Eq. (||) shows. 

One way of reorganizing the perturbation series is to make the expansion one in terms 
of ttslyi^iQ /Qt) instead of itself. In this simplified picture, calculating fixed order QCD 
corrections means calculating the perturbative series 

lim -—T- = 

Qt^O dQT 

Qt^ {as{iv[L + iv'o) + a%v'sL^ + iV^L^) + a%v'^L^ + ^^v'^L^) + ... 

+ a%v[L^ + 2v',L^) + a%v'^L^ + :,v'^L^) + ... 

+ ... ...}, 

column by column. In the leading logarithm approach, on the other hand, we calculate the 
above series line by line 0. While in the fixed order (column by column) calculation the 
convergence for low Qt is spoiled by the higher order uncalculated logs (L = \n.{Q /Qt)), in 
the resummed (line by line) calculation convergence is preserved in each "order" (by each 
line), and higher order corrections are systematically included. 
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1.3 The CSS formula 



The improved factorization theorem of CSS together with their evolution equation for the 
dependent parton distributions, leads to a useful formula^ for the cross section. For 
production it can be written as 

^^^dCP^dQ"^ ^dy^^ " ^ ^0jW^jj(Q, <5t, a^i, X2) + Y{Q, Qt, Xi, Xs), (3) 

where the "resummed" part, W{Q,Qt, xi,X2), is defined as 

Wjj{Q,QT,xi,X2) = (4) 
_i_ J £be''^-'q/M,bx^,fi)e-^^Q'''%/,,iQ,b,X2,fi) 

for a given partonic initial state with flavor j.| The Fourier integral is introduced because 
transverse momentum conservation is explicit in the impact parameter, b, space 
All the dangerous logarithms are included in the perturbative Sudakov exponent 



A («,(/!)) In (^^^ +B («,(/!)) 



(5) 



Here Cq is an arbitrary parameter which cuts off the perturbative low Qt region.^ To 
prevent perturbative calculations from being done in region where perturbation theory is 
inapplicable, the "impact parameter" b in the Sudakov exponent was replaced by 

b. = ^ (6) 

The errors caused by this replacement are of the same form as the non-perturbative contri- 
butions to be discussed below, and are therefore correctly treated by being absorbed into 
the non-perturbative part of the formula. 

The A and B functions are free of large logarithms and can be reliably calculated per- 
turbatively for a given process as 

A{aM) = t[^jAi''\ B{aM) = tl^^jBi-\ (7) 



^ While solving the RGE, an integro-differential equation, specific choices of integration constants were 
made (c.f. Ref. §): Ci = C3 = 2e~'"^ = Cq and C2 = C4 = 1, to optimize logarithmic contributions. This 
is similar to the fi = Q choice in case of the ultraviolet renormalization, to make terms like hi{^/Q) vanish. 

•^The lowest order partonic total cross section is aoj = n^g'^{{l — 4(5jS^)^ — l)/(48Q^c^), where g is the 
weak coupling constant, (c^) is the sine (cosine) of the weak mixing angle squared, and Qj is the charge 
of the quark flavor j. 

practice Cq = 2e~'^'^ is used, which is related to the values of the integration constants of the RGE 
for the k-T dependent PDF's. 
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The distributions 

depend on virtual and real emission contributions for a given process, via the Wilson coeffi- 
cients Cja- Just as the A and B functions the Wilson coefficients are expanded in terms of 
the strong coupling 

Q,{K,z,^^) = ±(^ycl-\z,K). (9) 

n=0 \ ^ / 

Since the Sudakov exponent integrates to unity, the Cij function sets the normalization 
of the resummed distribution. In particular, if coefficients up to Cf""* are included in the 
calculation then the resummed rate will equal the rate calculated in fixed order at 0{a'^) 
0. The function fa/h{x,fJ') is the usual renormalized momentum fraction (x) distribution of 
parton a in hadron h at the energy scale /i. Observe that the impact parameter dependence 
of the perturbative coefficient functions is cut off by the use of 6* instead of b. 

Included in Eq. (||) are two non-perturbative factors, J-'a/h{b,x) and e"'''-'')^"*^. These 
implement the parts of the CSS factorization and evolution equation that cannot be imple- 
mented as a resummation of the standard factorization theorem. They also compensate for 
the errors in the resummation at large b. The overall effect is that (^) define /cT-dependent 
parton densities. The JF factor can be interpreted as allowing for intrinsic transverse momen- 
tum, and the e~'"(^)^°'5 factor allows for the recoil against soft gluon radiation. The InQ in 
the exponent of the soft-gluon factor comes from the solution of the CSS evolution equation 
and can be interpreted by saying that soft gluons are emitted uniformly in rapidity. 

The perturbative part of the formula uses 6* instead of b, as defined by Eq. (^. The 
parameter bma,x provides an infra-red cutoff on the perturbative part of the formula. In 
practice the empirically optimal value, femax = 1/2 GeV~^, is used. This arbitrary cutoff of 
the b integration is compensated by the parameterization of the non-perturbative part of the 
formula, which is 

(Q,6,a;i,X2) = ^./^,(Q, 6, a;i)^,/,,(Q, 6, X2)e-^(^)i°« (10) 



^3 

= exp 



-9ib^ - 92b'^ In " fl'ifl's&ln (IOOX1X2) 



where Qq is chosen to be the initial scale of the parton evolution^ and the Qi parameters 
have to be determined using experimental data.0 



1.4 Matching 

The resummed term, defined by Eq. (H), was derived in the context of a generalized fac- 
torization, under the assumption that Qt ^ Q- This assumption will break down within 

5 For recent CTEQ PDF's Qo = 1-6 GeV. 

^The ln((5^/Qo) term is introduced to match the logarithmic term of the Sudakov exponent and its 
coefficient is expected to be process independent, depending only on the initial partonic state. 
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and beyond the intermediate Qt ~ Q region. In the high Qt region (where Qt ~ Q) the 
conventional perturbative factorization formahsm is rehable. To obtain sufficiently accurate 
results for all Qt-, it is necessary to combine the formalisms. 

The Y term in Eq. @ was introduced by CSS to correct the behavior of the resummed 
piece in the intermediate and high Qt regions.[] It is defined as the difference of the cross 
section calculated from the standard factorization formula at a fixed order n of perturbation 
theory and the Qt <^ Q asymptote of this cross section: 

YiQ,Qr,.uX.) = (-^^^j^)^- (-^^5^)^^^^^^. (11) 
Thus, the full CSS formula can be written as 



da ( da \ ( da \ ( da 



■ (12) 

n,QT<.Q 



dQ'^dQ\dy \dQ^ dQldy ) \dQ^ dQ^^dy J ^ \dQ^ dQ^^dv 
This method of matching the resummed and fixed order pieces is valid because the low 



Qt asymptote used in Eq. (|TT]) is the same as the large Qt asymptote of the resummed term 
W. At low Qt the asymptotic part dominates the Qt distribution (the logs are large), and 
the last two terms cancel in Eq. ([T2|) , while the resummed term is significant near Qt = 0. At 
high Qt the logs are small, and the expansion of the resummed term cancels the Qt singular 
terms up to higher orders in Og.^ In this situation the ffist and third terms cancel and CSS 
formula reduces to the fixed order perturbative result. After matching the resummed and 
fixed order cross sections in such a "smooth" manner, it is expected that the normalization 
of the CSS cross section reproduces the fixed order total rate, since when expanded and 
integrated over Qt it deviates from the fixed order result only in small higher order terms 
in as 0. 

Unfortunately the above argument does not completely work in practice. The problem 
arises because at large Qt the W term in Eq. (|^) is an extrapolation of the cross section from 
small Qt- So it has a 1/Qt behavior, modified by logarithms. This falls less steeply than 
the true cross section, which is subject to kinematic limits. The errors in the CSS formula 
at large Qt are indeed suppressed by a power of ag. But the coefficient of this power is the 
1/Qt part of the formula, and so the error can be easily larger than the true cross section. 
A symptom of the problem is that the cross section calculated from Eq. (|^) is typically 
negative at large enough Qt- 

One possible remedy |^ is to abandon the CSS formalism. But we regard this as un- 
desirable, because it also abandons the important physical result of CSS that goes beyond 
mere resummation: their proper treatment of non-perturbative transverse momentum. 

A second, commonly used remedy, is to utilize the fact that in the high Qt region the 
fixed order result is a good description of the distribution. So when calculating the Qt 
distribution one can simply switch from the CSS to the fixed order distribution whenever 
they cross for high Qt's. Since the mismatch between the resummed and the asymptotic 

^The exact definition of the Y piece for production can be found in Refs. ^. 

^The cancellation is higher order than the order at which the singular pieces were calculated. 
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30 40 50 60 70 80 

QjiGeV) 

Figure 2: Transverse momentum distributions calculated using the CSS (solid and dashed) and 
the usual factorization (dot-dashed and dotted) formalisms. The CSS calculation is not switched 
over to the usual formalism to show the typical "kink" which occurs at a switching point. The 
solid and dot-dashed curves are calculated at 0{a'^), while the dashed and dotted curves are 
at 0{as), illustrating the improvement of the high Qt behavior of the CSS formula with the 
perturbative order. 
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CDF Preliminary 
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Figure 3: Same as Fig. |], except shown for the full range of transverse momentum Qt- Switching 
from the CSS to the usual factorization formalism in the 60 ~ Qt ~ 70 GeV region (c.f. [§]), 
results in a smooth Qt distribution which agrees well with the experiment in the full Qt range. 
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terms in Eq.(0) decreases as the perturbative order of the calculation (n) increases, it is 
expected that the crossing point shifts toward Qt = Q, and the slope of the resummed and 
fixed order curves approaches each other as n increases (cf. Ref. 0). Indeed, calculations 
at C(a^) blend closer to mz, and smoother than at 0{as), as shown in Fig. ^. 

When this prescription for the switching is followed at the fully differential, da/dQTdQdy, 
level the result is a smooth and differentiable Qt distribution, after the invariant mass and 
rapidity is integrated out. This is illustrated in Fig. It was shown in Ref. P] that the 
integral of the Qt distribution calculated using this prescription recovers the fixed order 
total rate within an error which is the size of the higher orders, as it is expected. 



1.5 Improved matching 



Since the calculations of W and Y are done using truncations of perturbation theory, the 
switching between calculational methods introduces an artificial discontinuity in the slope 
of the cross section. This practical problem arises in the matching because of a mismatch 
of the orders of perturbation theory at which W and Y are calculated. From the point of 
view of a standard factorization calculation, W contains a selective summation of arbitrarily 
high orders of perturbation theory. The possibility of getting such a resummation relies 
on performing certain approximations that are only valid at small Qt- The difficulty of 
performing complete higher-order calculation means that Y can only be calculated at fixed 
order. 

At large transverse momentum, is much larger than the actual cross section, and so 
the cross section Eq. (|T2|) is obtained by the cancellation of two almost equal terms. This is 
clearly a recipe for bad numerical work. 

Examination of the lowest-order calculation of Y, for the qq annihilation term shows 
some of the sources of the problems: 



Y 



C fdiidi: 



Qt 



6 6 



if + (Q2 



u 



-S{s + i + u - Q^ 



-25(l-xi/ei)5(l-X2/e 
-5(1 - xi/^i 



HQ^Ql) - ^ 









_1 - 


+ 





(13) 



5(1-0:2/6) 



Here xi and X2 are the longitudinal momentum fractions of the Drell-Yan pair. The first 
term contains the usual perturbative calculation of the differential cross section, and the 
other 3 terms give the negative of its low Qt asymptote. The intrinsic rate of fall off of the 
cross section with Qt is given by the explicit 1/Qt factor which is present in the parton 
cross section. But an extra fall off is caused by the fact that the parton densities are probed 
at larger fractional momenta when Qt is increased. 

Some symptoms of the problems can already be seen. One is that the first subtraction 
term, on the second line of Eq. (|l^), changes sign at large Qt- the extrapolation of a positive 
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cross section becomes negative. The second is the plus distribution in the third hne; if the 
parton distributions are steeply falling, the plus distributions give a misleading size for the 
integrand. This last effect really indicates that there is an additional scale in the process, so 
that the relevant scales are: 

• The transverse momentum Qt of the Drell-Yan pair. 

• The invariant mass Q of the pair. 

• The increase AQ of Q that is necessary to make the typical parton densities in the 
factorization formula decrease by a factor 2. 

We believe the overall approach of a subtraction method is correct: W correctly represents 
the physics at low Qt, and we do not wish to give up a method that uses the intuitive notion 
of /c^-dependent parton densities. We therefore cannot expect to obtain a perfect estimate of 
the large Qt cross section from W alone. The idea of adding a correction term F is a good 
way of combining the information in standard fixed order calculations with the resummed 
calculations. 

But improvements in its implementation are needed. We suggest the following strategies 
that could be tried, individually or even in combination: 

• Multiply W by an ad hoc factor F{Qt/M). Correspondingly the formula for the 
subtraction term in Y will also have the same factor. The parameter M is in principle 
arbitrary, and it should be chosen so that the fall off in the modified W term mimics 
that of the actual cross section. The cut-off function obeys F(0) = 1, so that the small 
Qt behavior is unchanged, and the function should be zero for large Qt- 

• Change the argument of W from Qt to some other function of Qt- One possible choice 
would be Q'j, = Qt/{^ — Qt/M), where M is again a parameter to be chosen. One 
would replace W by zero if Qt > M. The effect of the variable change is to leave W 
unaltered at small Qt and to give a more rapid fall off at large Qt- Again one would 
make an identical redefinition in the subtraction term in Y. 

• Redefine the + distributions such as those in Eq. ([I3|), by: 



(The usual definition has zq = 1.) 

In each case we have a generalized renormalization-group invariance of the exact cross section 
under changes of the parameter M or zq- But approximations obtained by truncation of a 
perturbation series are invariant only up to a term of order the first uncalculated correction. 
The aim is to choose the parameters on physical grounds to be such as to keep these higher 
order terms small, to eliminate their reason(s) for being large. 




(14) 
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1.6 Applications 



Beyond production, in its present form, the CSS formalism can be applied in hadron- 
hadron collisions whenever the final state is colorless. The phenomenological significance of 
this "transverse momentum resummation" ranges from Drell-Yan pair production, through 
lepton pair production via and bosons di-gauge boson (e.g. photon or Z° boson 
pair) production |10|, [Tl|], to Higgs production |T3|. In recent years it was tested in 

hadronic processes taking place at fixed target (e.g. in DY photon and diphoton production) 
T5| and collider energies (e.g. in DY, ^ Z^, and diphoton production). It was apphed 



for different hadronic initial states in pion-nucleon, proton-nucleon, and proton-anti-proton 
collisions. It was also modified and tested for DIS processes [0]. Finally, since was first 
devised for the calculation of the energy correlation of jets in e~^e~ collisions 0, it can be 
used in jet production at lepton colliders. Such a wide variety of applicability, and good 
agreement with existing experimental results for different processes, colliders, center of mass 
energies, and initial states gives us a confidence in the resummed predictions for the LHC 



2 Higgs production 

At the LHC the SM Higgs boson will be mainly produced through the gluon fusion subprocess 
via a top quark loop: gg (top quark loop) — > HX The Higgs boson can be detected 



in its if ^ 77 decay mode, if its mass is in the 100-150 GeV range |18|. If the Higgs mass 
is higher than about 130 GeV then its H Z^Z^* decay mode is the cleanest and most 
significant JlSl. To distinguish these signals from the substantial QCD background, besides 
the sharp peak in the invariant mass distribution, the most straightforward measurable to 
use is the transverse momentum. According to earlier studies, a statistical significance on 
the order of 5-10 can be reached for the inclusive if ^ 77 signal, actual values depending 
on luminosity and background estimates. Once their transverse momentum distribution is 
reliably predicted, the difference in the Qt of the signal and background can be utilized to 
devise kinematic cuts to enhance the statistical significance of the signal. After the discovery, 
when determining the properties of the Higgs boson, besides the total cross section and the 
invariant mass distribution, the simplest and most fundamental measurable to use is the 



transverse momentum. For a recently proposed new detection mode, H — > 77jet, in Ref. [|19 
it was also found that in order to optimize the significance it is necessary to impose a 30 
GeV cut on the transverse momentum of the jet, or equivalently (at NLO precision), on the 
Qt of the photon pair. With this cut in place extraction of the signal in the Higgs plus jet 
mode requires the precise knowledge of both the signal and background distributions in the 
mid- to high-Qr region. 

To reliably predict the Qt distribution of Higgs bosons at the LHC, especially in the 
low to mid Qt region where the bulk of the rate is, the effects of the multiple soft -gluon 
emission have to be included. In practice, performing soft gluon resummation within the 
CSS formalism is equivalent to the determination of the A^"'\ B^'^\ and C*-"-* coefficients 
and the Y part at some order in as- One way to calculate the coefficients is to expand 
the resummed part in terms of the strong coupling (expanding the exponent an the Wilson 



73 



coefficients), and compare the expansion with a fixed order calculation. Luckily, because 
of its significance, there was much work done on fixed order QCD corrections to Higgs 
production in the gg — *• HX channel. These fixed order QCD corrections are known to 
substantially increase the rate: by about 70 to 100 percent, depending on the Higgs mass, 
at O(a^)|20|, g^], and by an additional 50 to 70 percent at 0{aj) [0. It is expected that 
the calculation of even higher order corrections is important to reliably predict the cross 
section. In Ref. it was shown that multiple soft-gluon emission dominates the higher 
order corrections. 



2.1 Soft gluon resummation for the gg HX channel 

Resummed calculations, taking into account the soft-gluon effect, attempted to estimate the 
size of the non-calculated higher order corrections , as well as provide a reliable shape of 



the Higgs transverse momentum distribution |]T2|, [T^ . Our present approach surpasses these 
by calculating the Qt distribution while including 0{a'^) terms in the Sudakov exponent, 
using the state of the art matching to the latest fixed order distributions, using a QCD im- 
proved gluon- Higgs effective coupling ||2^, and using an improved non-perturbative function. 
We utilize the approximation that the object which couples the gluons to the Higgs (the top 
quark in the SM), is much heavier than the Higgs itself. This approximation is not essential 
to our calculation and can be released by the calculation of the further Wilson coefficients 
keeping the relevant masses. The heavy quark approximation in the SM was shown to be 



reliable within 5 percent for nin < 2mt [|22| , |20| , p5| , and still reasonable even in the range 



of niH ~ 2mt It has also been shown that the approximation remains valid for the Qt 



distribution in the large Qt region, provided that niH < rrit and Qt < mt In this work 
we assume that the approximation is valid in the whole Qt region. Unlike the authors of 
Ref. we do not assume that the QCD corrections to the gg HX cross section can be 
factorized into a multiplicative term in the heavy quark limit in all orders of a^. We can 
release this approximation because the CSS formalism, by definition, systematically incor- 
porates higher order fixed order corrections via the definition of the Sudakov exponent and 
the Wilson coefficients as perturbative expansions ^f. 

Multiple soft-gluon emission affects the gg HX cross section when the transverse 
momentum of the Higgs is low, while for high transverse momenta the hard gluon radiation 
is dominant. Thus, using the CSS formalism we resum large logs of the type Iii^Q/Qt) in 
the low Qt region, and we match the resummed result to the fixed order calculation which 
is valid for high Qt 0. We also include the qg and qq subprocesses which, depending on 
the Higgs mass, together constitute to 10 percent of the total rate . 



The resummed differential cross section of the Higgs boson production in hadronic colli- 
sions is written as 



dQ-dydQl ="°T"^^^ -"^-^ 
X 1^ j d%e''^^-%g{h,Q,xi,X2,C,,2,3) 
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X Wl''{b,Q,Xi,X2) 
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Y{Qt,Q,Xi,X2,C4)] . 



(15) 



The kinematic variables Q, y, and Qt are the invariant mass, rapidity, and transverse mo- 
mentum of the Higgs boson in the laboratory frame. The parton momentum fractions are 
defined as xi = e'^Q/y/S, and X2 = e^^Q/v^, with being the center-of-mass (CM) 
energy of the hadrons hi and /i2. The lowest order cross section, with the QCD corrected 
effective coupling of the Higgs boson to gluons is 



V2GFal{Q' 
5767r 



(16) 



where is the Fermi constant and is defined in Ref. [^. The renormalization group 
invariant kernel of the Fourier integral Wgg and the regular terms Y{QT,Q,Xi,X2,Ci) (to- 
gether with the variables 6* and Ci to C4) are defined in Ref. [|l^]. In addition to Ref. |]l3l 
we use the process independent coefficient 



= C, 



6T 5 

] Nc Nf 

,36 12/ ^ 18 



(17) 



in the expansion of the A function {Nc 
active quark flavors). 



3 the number of colors and Nf = 5 the number of 



2.2 Some numerical results 

The resummation formula is coded in the ResBos Monte Carlo event generator ^, which 
uses the following electroweak input parameters Gp = 1.16639 x 10~^ GeV~^,mz = 



91.187 GeV, m-yv = 80.36 GeV. The NLO expressions for the running electromagnetic and 
strong couplings a{fi) and as{fi) are used, as well as the NLO parton distribution function set 
CTEQ4M (defined in the modified minimal subtraction, MS, scheme). The renormalization 
and factorization scales are set equal to the Higgs invariant mass. In the choice of the non- 



perturbative parameters we follow Ref. p8|]. Since we are not concerned with the decays of 
Higgs bosons in this work, we do not impose any kinematic cuts. 

Fig. I displays production cross sections at the LHC, calculated in the SM as the function 
of the Higgs mass. Our 0{al) curve agrees well with the result in Ref. [23]. The ratio of 
the fixed order 0{a^) (dashed) and the lowest order C(a^) (dotted) curves varies between 
2.35 and 2.00. We note that less than 2 percent of the 0{al) corrections come from the qg 
and qq initial states for Higgs masses below 200 GeV. The resummed curve is slightly (5 
to 6 percent) higher than the 0{al) one, as expected based on the findings that the CSS 
formalism preserves the fixed order rate within the error of the matching (which is expected 
to be higher order) 0. The resummed rate is close to the 0{al), because we used the 0{al) 
fixed order results to derive the Wilson coefficients which are utilized in our calculation. In 
Ref. the O^a^) corrections were utilized to show that in the high Qt region the O^a^) 
to 0{al) _ft"-factor is nearly constant and is about 1.5 (for CTEQ4M parton distributions). 
Based on this finding we also plot the C(a^) curve rescaled by 1.5, to illustrate the size of 
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Figure 4: SM Higgs boson production cross sections at the LHC via top quark loop as the 
function of the Higgs mass, with QCD corrections calculated by soft-gluon resummation (solid), 
at fixed order 0{al) (dashed), and without QCD corrections at 0{a1) (dotted). The 0{al) 
curve is scaled by 1.5 (dash-dotted, c.f. Ref. ||22[]) to estimate the 0{a^) result. 
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Figure 5: Higgs boson transverse momentum distributions at the LHC, illustrating the effect of 
various contributions of the CSS formalism. Among the pair of curves the ones which peak lower 
are calculated by using A^^'"^^ and the others by A^^\ Additionally, the solid curves include 5^ 
and C^^\ the dashed ones C^^\ and the dotted ones B^^\ The dot-dashed curves only include A 
coefficients. The lower portion of the figure shows the same curves normalized to the area under 
the lower peaking solid curve, to compare the changes in the shape. 
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the 0{ag) corrections and to establish the normahzation of our resummed calculation among 
the fixed order results. 

Fig. ^illustrates the effect of the various contributions of the CSS formalism on the Higgs 
boson transverse momentum distribution. The lower peaking curves, drawn by the same type 
line, contain the coefficients A^^''^\ The others lack the A^'^^ coefficient. Comparison of pairs 
of curves shows that the log multiplied by the A^^^ coefficient increases the rate by about 
10% around the peak, and decreases it in the mid-Q^ region. The figure also shows that 
exclusion of the B^^^ term leads to about 40% decrease around the peak, and an increase 
away from it. Finally, the exclusion of the C*-^-* coefficient decreases the overall rate by about 
a factor of 2, coupled with some shape change similar to the B^^^ case. 

Fig. H displays transverse momentum distributions of Higgs bosons produced at the 
LHC. The Qt distribution is calculated under several different assumptions for the non- 
perturbative sector of the CSS formalism, in order to span the range of scatter of these 
different predictions. In Fig. ^ the (solid) curve using the result of the latest 3-parameter 
fit for the non-perturbative function is shown. (The actual values of the parameters 
used are: gi = 0.15 GeV^, g2 = (Cf/Ca) * 0.48 GeV^, and c/3 = -0.58 GeV^^) Also shown 
the (dashed) curve using the result of the latest 3-parameter fit of Ref. (The values 

were used are: gi = 0.24 GeV^, and ^2 = (Cf/Ca) * 0.34 GeV^.) We plotted the (dotted) 
curve using the previous 3-parameter fit of Ref. [^, as well. In the lower portion of the 
figure we show the ratios of the different curves to the solid curve. From this we conclude 
that the three different parameterizations differ by about 5 percent, at most, in the relevant 
Qt region. At Qt = 10 GeV, in the region of the peak of the distribution, the difference is 
about 2 percent. 

In Fig. the solid curve is the same as in Fig. In this figure results using g2 = 
(Cf/Ca) * 0.33 GeV^, and ^2 = (Cf/Ca) * 0.69 GeV^ values are plotted (dashed). These 
g2 values are 3 a deviations from the central value g2 = (Cf/Ca) * 0.48 GeV^ of the new 
3-parameter fit. Also shown a curve with g2 = 0.48 GeV^, where the assumption that the 
non-perturbative parameter g2 scales by Ca/Cf for the gluonic initial state was not utilized. 
The lower portion of the figure shows that the ratios of the various curves to the solid curve 
do not deviate from 1 significantly except in the very low Qt (< 5 GeV) region. 
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A Comparison of the Predictions from Monte Carlo 
Programs and Transverse Momentum Resummation 



C. Balazs, J. Huston, I. Puljak 



Abstract 

Monte Carlo event generators are being increasingly relied upon for predictions of 
experimental observables at colliders. In this section, the parton shower formalism for 
Monte Carlos is compared to that of analytic resummation calculations. Predictions 
for the transverse momentum distribution of bosons, photon pairs, and the Higgs 
boson are compared for the Tevatron and the LHC. [| 

1 Introduction 

Parton shower Monte Carlo programs such as PYTHIAp|, HERWIG0 and ISAJET^ are com- 
monly used by experimentalists, both as a way of comparing experimental data to theoretical 
predictions, and also as a means of simulating experimental signatures in kinematic regimes 
for which there is not yet experimental data (such as the LHC). The final output of the 
Monte Carlo programs consists of the 4-vectors of a set of final state hadrons; this output 
can either be compared to reconstructed experimental quantities or, when coupled with a 
simulation of a detector response, can be directly compared to raw data taken by the ex- 
periment, and/or passed through the same reconstruction procedures as the raw data. In 
this way, the parton shower programs can be more useful to experimentalists than analytic 
calculations. Indeed, almost all of the physics plots in the ATLAS physics TDR [5] involve 
comparisons to PYTHIA (version 5.7). 

For many physical quantities, the predictions from parton shower Monte Carlo programs 
should be nearly as precise as those from analytic theoretical calculations. This is expected, 
among others, for calculations which resum logs with the transverse momentum of partons 
initiating the hard scattering. In the recent literature, most calculations of this kind are 
either based on or originate from the formalism developed by J. Collins, D. Soper, and G. 
Sterman (CSS) 0, which we choose as the analytic 'benchmark' of this section. In this case, 
both the Monte Carlo and analytic calculations should accurately describe the effects of the 
emission of multiple soft gluons from the incoming partons, an all orders problem in QCD. 
The initial state soft gluon emission can affect the kinematics of the final state partons. This 

more complete treatment of this subject can be found in Ref. |]. 
^See, for example, the discussion in the previous section. 
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may have an impact on the signatures of physics processes at both the trigger and analysis 
levels and thus it is important to understand the reliability of such predictions. The best 
method for testing the reliability is the direct comparison of the predictions to experimental 
data. If no experimental data is available for certain predictions, then some understanding 
of the reliability may be gained from the comparison of the predictions from the two different 
methods. 



For technical reasons, the initial state parton shower proceeds by a backwards evolution, 
starting at the large (negative) scale of the hard scatter and then considering emissions 
at lower and lower (negative) virtualities, corresponding to earlier points on the cascade 
(and earlier points in time), until a scale corresponding to the factorization scale is reached. 
The transverse momentum of the initial state is built up from the whole series of splittings 
(and boosts). The showering process is independent of the hard scattering process being 
considered (as long as one does not introduce any matrix element corrections), and depends 
only on the initial state partons and the hard scale of the process. 

In the case of parton showering, the leading order collinear singularities factorize for cross 
sections in the collinear limit 



where Ain+i is the invariant amplitude for the process producing n partons and a gluon, gg is 
the strong coupling constant, pb and Pg are the 4- momenta of the daughters of the n'th parton 
a (i.e. a splits into b and g, and when they are collinear then Pb-Pg — > 0). Finally Pg^-a{z) 
is the DGLAP splitting kernel belonging to the a ^ g splitting. The leading order collinear 
singularities can be factorized into a Sudakov form factor: S = 1 — P{no emission) = 
exp{— J dp^ /p^ J dzP{z)). The distribution 1 — S can be used to generate the for the first 
emission and hence for the whole cascade. The formalism can be extended to soft singularities 
as well by using angular ordering. In this approach, the choice of the hard scattering is based 
on the use of evolved parton distributions, which means that the inclusive effects of initial- 
state radiation are already included. What remains is therefore to construct the exclusive 
showers. 

Parton showering resums primarily the leading logs, which are universal, i.e. process 
independent, and depend only on the given initial state. In this lies one of the strengths of 
Monte Carlos, since parton showering can be incorporated into a wide variety of physical 
processes. An analytic calculation, in comparison, can resum all logs. For example, the CSS 
formalism sums all of the logarithms with lp\ in their arguments, where (for Higgs boson 
production) Q is the four momentum of the Higgs and pt is its transverse momentum. As 
discussed in the previous section on resummation, all of the 'dangerous logs' are included in 
the Sudakov exponent, which can be written in the impact parameter (6) space as: 



2 Parton Showering and Resummation 



lim \Mn^\^ ^ gl{Ph-Vg) ^Pg^a{z)\M. 
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with the A and B functions being free of large logs and perturbatively calculable: 

n=l V ^ / n=l V ^ / 

These functions contain an infinite number of coefficients, with the A^'^^ coefficients be- 
ing universal while the B^"'^ are process dependent. In practice, the number of towers of 
logarithms included in the Sudakov exponent depends on the level to which a fixed order 
calculation was performed for a given process. For example, if only a next-to-leading order 
calculation is available, only the coefficients A^^^ and B^^^ can be included. If a NNLO calcu- 
lation is available, then A^'^^ and B^"^^ can be extracted and incorporated into a resummation 
calculation, and so on. This is the case, for example, for boson production. So far, only 
the A^^\ A^'^'^ and B^^^ coefficients are known for Higgs production but the calculation of 
is in progress. [H If we try to interpret parton showering in the same language, which is 
admittedly risky, then we can say that the Monte Carlo Sudakov exponent always contains 
a term analogous to It was shown in Reference that a suitable modification of 

the Altarelli-Parisi splitting function, or equivalently the strong coupling constant as, also 
effectively approximates the A^'^^ coefficient. f\ 

In contrast with the shower Monte Carlos, analytic resummation calculations integrate 
over the kinematics of the soft gluon emision, with the result that they are limited in their 
predictive power for inclusive final states. While the Monte Carlo maintains an exact treat- 
ment of the branching kinematics, in the original CSS formalism no kinematic penalty is 
paid for the emission of the soft gluons, although an approximate treatment of this can 
be incorporated into its numerical implementations, like ResBos |Q. Neither the parton 
showering process nor the analytic resummation translate smoothly into kinematic configu- 
rations where one hard parton is emitted (at large px)- In the Monte Carlo matrix element 
corrections, and in the analytic resummation calculation matching is necessary. This match- 
ing is standard procedure for resummation calculations and matrix element corrections are 
becoming increasingly common in Monte Carlos P, p!0|. 



With the appropriate input from higher order cross sections, a resummation calculation 
has the corresponding higher order normalization and scale dependence. The normalization 
and scale dependence for the Monte Carlo, though, remains that of a leading order calcu- 
lation. The parton showering process redistributes the event particles in phase space, but 
does not change the total cross section (for example, for the production of a Higgs boson). 

In particular, one quantity which should be well-described by both calculations is the 
transverse momentum (pt) of the final state electroweak boson in a subprocess such as 
qq —>■ WX, ZX or gg —>■ HX, where most of the pt is provided by initial state parton 
showering. The parton showering supplies the same sort of transverse kick as the soft gluon 
radiation in a resummation calculation. Indeed, very similar Sudakov form factors appear 
in both approaches, with the caveats about the A^''^^ and i?^*^-* terms mentioned previously. 



■^This is rigorously true only for the high x or \/r region. 

"^Technically, one could add the branching for q — > g+Higgs in the shower, which would have the capability 
of increasing somewhat the Higgs cross section; however, the main contribution to the higher order X-factor 
comes from the virtual corrections and the 'Higgs Bremsstrahlung' contribution is neglible. 
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This correspondence between the Sudakov form factors in resummation and Monte Carlo 
approaches may seem trivial, but there are many subtleties between the two approaches 
relating to both the arguments of the Sudakov factors as well as the impact of subleading 
logs p. 

At a point in its evolution corresponding to (typically) the virtuality of a few GeV^, the 
parton shower is cut off and the effects of gluon emission at softer scales must be parame- 
terized and inserted by hand. This is similar to the (somewhat arbitrary) division between 
perturbative and non-perturbative regions in a resummation calculation. The parameteriza- 
tion is typically done with a Gaussian formalism similar to that used for the non-perturbative 

in a resummation program. In general, the value for the non-perturbative (kx) needed in 
a Monte Carlo program will depend on the particular kinematics being investigated. In the 
case of the resummation calculation the non-perturbative physics is determined from fits to 
fixed target data and then automatically evolved to the kinematic regime of interest. 

A value for the average non-perturbative fc-r of greater than 1 GeV does not imply that 
there is an anomalous intrinsic fcy associated with the parton size; rather this amount of 
(kx) needs to be supplied to provide what is missing in the truncated parton shower. If the 
shower is cut off at a higher virtuality, more of the 'non-perturbative' kj- will be needed. 



3 Boson Production at the Tevatron 

The 4-vector of a boson, and thus its transverse momentum, can be measured with great 
precision in the e~^e~ decay mode. Resolution effects are relatively minor and are easily cor- 
rected. Thus, the pt distribution is a great testing ground for both the resummation and 
Monte Carlo formalisms for soft gluon emission. The (resolution corrected) pt distribution 
for bosons (in the low Pt region) for the CDF experiment]^ is shown in Figure |l], compared 
to both the resummed prediction from ResBos, and to two predictions from PYTHIA (version 
6.125). One PYTHIA prediction uses the default (rms)Q value of intrinsic kx of 0.44 GeV and 
the second a value of 2.15 GeV (per incoming parton). [] The latter value was found to give 
the best agreement for PYTHIA with the data.Q All of the predictions use the CTEQ4M 



parton distributions [|T2|. The shift between the two PYTHIA predictions at low px is clearly 
evident. As might have been expected, the high p^ region (above 10 GeV) is unaffected 
by the value of the non-perturbative kjn. Note that much of the kx 'given' to the incoming 
partons at their lowest virtuality, Qq, is reduced at the hard scatter due to the number of 
gluon branchings preceding the collision. The emitted gluons carry off a sizeable fraction of 
the original non-perturbative kr- This point will be investigated in more detail later for the 
case of Higgs production. 



^We thank Willis Sakumoto for providing the figures for CDF production 

^For a Gaussian distribution, k™" = 1.13(fcT). 

previous publication indicated the need for a substantially larger non-perturbative (kx), of the 
order of 4 GeV for the case of W production at the Tevatron. The data used in the comparison, however, 
were not corrected for resolution smearing, a fairly large effect for the case otW eiy production and decay. 

*A similar conclusion has been reached for comparisons of the CDF Z*^ px data with HERWIG 
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As an exercise, one can transform the resummation formula in order to bring it to a 
form where the non-perturbative function acts as a Gaussian type smearing term. Using the 
Ladinsky-Yuan parameterization |n] of the non-perturbative function in ResBos leads to an 
rms value for the effective /ct smearing parameter, for production at the Tevatron, of 2.5 
GeV. This is similar to that needed for PYTHIA and HERWIG to describe the Z° production 
data at the Tevatron. 

In Figure |l|, the normalization of the resummed prediction has been rescaled upwards by 
8.4%. The PYTHIA prediction was rescaled by a factor of 1.3-1.4 (remember that this is only 
a leading order comparison) for the shape comparison. 
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Figure 1: The distribution (at low px) from CDF for Run 1 compared to predictions 

from ResBos and from PYTHIA. The two PYTHIA predictions use the default (rms) value for the 
non-perturbative (0.44 GeV) and the value that gives the best agreement with the shape of 
the data (2.15 GeV). 

As stated previously, the resummed prediction correctly describes the shape of the Z° 
Pt distribution at low pt, even with the optimal non-perturbative kr, although there is still 
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a noticeable difference in shape between the Monte Carlo and the resummed prediction. 
It is interesting to note that if the process dependent coefficients (B^^^ and B^"^^) were not 
incorporated into the resummation prediction, the result would be an increase in the height 
of the peak and a decrease in the rate between 10 and 20 GeV, leading to a better agreement 
with the PYTHIA prediction |T5|. 



CDF Preliminary 
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Figure 2: The Z'^ px distribution (for the full range of J9t) ^fom CDF for Run 1 compared to 
predictions from ResBos (curve) and from PYTHIA (histogram). 

The Z° Pt distribution is shown over a wide px range in Figure |^. The PYTHIA and 
ResBos predictions both describe the data well. Note especially the agreement of PYTHIA 
with the data at high pt, made possible by explict matrix element corrections (from the 
subprocesses qq — ^ Z^g and gq Z^q) to the Z^ production process.^ 

^Slightly different techniques are used for the matrix element corrections by PYTHIA |^ and by HERWIG ||l0[| . 
In PYTHIA, the parton shower probabihty distribution is appUed over the whole phase space and the exact 
matrix element corrections are applied only to the branching closest to the hard scatter. In HERWIG, the 
corrections are generated separately for the regions of phase space unpopulated by HERWIG (the 'dead zone') 
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4 Diphoton Production 



Most of the experience that we have for comparisons of data to resummation calcula- 
tions/Monte Carlos deals with Drell-Yan production, i.e. qq initial states. It is important 
then to examine diphoton production at the Tevatron, where a large fraction of the contri- 
bution at low mass is due to gg scattering. The prediction for the diphoton kr distribution 
at the Tevatron, from PYTHIA (version 6.122), is shown in Figure ^ using the experimen- 
tal cuts applied in the CDF analysis It is interesting to note that about half of the 



diphoton cross section at the Tevatron is due to the gg subprocess, and that the diphoton 
Pt distribution is noticeably broader for the gg subprocess than the qq subprocess. 

A comparison of the pr distributions for the two diphoton subprocesses (gg, gg) in PYTHIA 
versions 5.7 and 6.1 is shown in Figure ^. There seems to be little difference in the pt 
distributions between the two versions for both subprocesses. 

In Figure ^ are shown the ResBos predictions for diphoton production at the Tevatron 
from qq and gg scattering compared to the PYTHIA predictions (using the same experimental 
cuts). The gg subprocess predictions in ResBos agree well with those from PYTHIA while the 
qq Pt distribution is noticebly broader in ResBos. The latter behavior is due to the presence 
of the Y piece in ResBos at moderate pt-, and the matching of the qq cross section to the 
fixed order qq — »• 77(7 at high pt- The corresponding matrix element correction is not in 
PYTHIA. It is interesting to note that the PYTHIA and ResBos predictions for gg 77 agree 
in the moderate pt region, even though the ResBos prediction has the Y piece present and 
is matched to the matrix element piece gg — > 77(7 at high px, while there is no such matrix 
element correction for PYTHIA. This shows the smallness of the Y piece for the gg subprocess. 



which is the same conclusion that was reached in Ref. [|17l. One way to understand this is 
recalling that the gg parton-parton luminosity falls very steeply with increasing partonic 
center of mass energy, V^. This falloff tends to suppress the size of the Y piece since the 
production of the diphoton pair at higher pt requires larger Xi, X2 values. In the default CSS 
formalism, there is no such kinematic penalty in the resummed piece since the soft gluon 
radiation comes for "free". (Larger xi and X2 values are not required.) 

A comparison of the CDF diphoton data to NLO [1^ and resummed (ResBos) QCD 



predictions is shown in Figure ^. Plotted are the diphoton mass, the angle A0 between 
the two photons and the transverse momentum kx of the diphoton pair. The transverse 
momentum distribution, in particular, is sensitive to the effects of the soft gluon radiation 
and better agreement can be observed with the ResBos prediction than with the NLO one. 
The data shown in this figure is from an integrated luminosity of 87 ph^^ . A much more 
precise comparison with the effects of soft gluon radiation will be possible with the 2 fh~^ 
or greater data sample that is expected for both CDF and DO in Run 2. 

The prediction for the diphoton production cross section, as a function of the diphoton 
Pt and using cuts appropriate to ATLAS and CMS, is shown in Figure 0. Note that, as at 
the Tevatron, about half of the cross section is due to gg scattering and the diphoton pt 



and the populated region. In the dead zone, the radiation is generated according to a distribution using the 
first order matrix element calculation, while the algorithm for the already populated region applies matrix 
element corrections whenever a branching is capable of being 'the hardest so far'. 
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Figure 3: A comparison of the PYTHIA predictions for diphoton production at the Tevatron for 
the two different subprocesses, qq and gg. The same cuts are applied to PYTHIA as in the CDF 
diphoton analysis. 
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Figure 4: A comparison of the PYTHIA predictions for diphoton production at the Tevatron for 
the two different subprocesses, qq and gg, for two recent versions of PYTHIA. The same cuts are 
applied to PYTHIA as in the CDF diphoton analysis. 
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Figure 5: A comparison of the PYTHIA and ResBos predictions for diphoton production at the 
Tevatron for the two different subprocesses, qq and gg. The same cuts are applied to PYTHIA 
and ResBos as in the CDF diphoton analysis. 
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Figure 6: A comparison of the NLO and ResBos predictions for diphoton production at the 
Tevatron for the diphoton mass, the angle and the transverse momentum of the photon pair 

Kt. 
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distribution from gg scattering is noticeably broader than that from qq production. 




Figure 7: A comparison of the PYTHIA predictions for diphoton production at the LHC for the 
two different subprocesses, qq and gg. Similar cuts are applied to the diphoton kinematics as 
those used by ATLAS and CMS. 

In Figure ^ is shown a comparison of the diphoton px distribution for two different ver- 
sions of PYTHIA, for the two different subprocesses. Note that the Pt distribution in PYTHIA 
version 5.7 is somewhat broader than that in version 6.122 for the case of gg scattering. The 
effective diphoton mass range being considered here is lower than the 150 GeV Higgs mass 
that will be considered in the next section. As will be seen, the differences in soft gluon 
emission between the two versions of PYTHIA are larger in that case. 

In Figure ^ are shown the ResBos predictions for diphoton production at the LHC from 
qq and gg scattering compared to the PYTHIA predictions (using the same experimental cuts). 
Again, the gg subprocess predictions in ResBos agree well with those from while the qq pt 
distribution is noticebly broader in ResBos, for the reasons cited previously. 
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Figure 8: A comparison of the PYTHIA predictions for diphoton production at the LHC for the 
two different subprocesses, qq and gg, for two recent versions of PYTHIA. Similar cuts are applied 
to the diphoton kinematics as are used by ATLAS and CMS. 
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Figure 9: A comparison of the PYTHIA and ResBos predictions for diphoton production at 
the LHC for the two different subprocesses, qq and gg. Similar cuts are applied to PYTHIA and 
ResBos as in the ATLAS and CMS diphoton analyses. 
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5 Higgs Boson Production 



A comparison of the Higgs pt distribution at the LHC, for a Higgs mass of 150 GeV, is 
shown in Figure rU, for ResBos and the two recent versions of PYTHIA. As before, PYTHIA 
has been rescaled to agree with the normahzation of ResBos to allow for a better shape 
comparison. Note that the peak of the resummed distribution has moved to ~ H GeV 
(compared to about 3 GeV for production at the Tevatron). This is partially due to the 
larger mass (150 GeV compared to 90 GeV), but is primarily because of the larger color 
factors associated with initial state gluons {Ca = 3) rather than quarks {Cp = 4/3), and 
also because of the larger phase space for initial state gluon emission at the LHC. 




Figure 10: A comparison of predictions for the Higgs px distribution at the LHC from ResBos and 
from two recent versions of PYTHIA. The ResBos and PYTHIA predictions have been normalized 
to the same area. 

The newer version of PYTHIA agrees well with ResBos at low to moderate pt, but falls 
below the resummed prediction at high pt- This is easily understood: ResBos switches to the 



96 



NLO Higgs + jet matrix element at high px while the default PYTHIA can generate the Higgs 
Pt distribution only by initial state gluon radiation, using as maximum virtuality the Higgs 
mass squared. High px Higgs production is another example where a 2 ^ 1 Monte Carlo 
calculation with parton showering can not completely reproduce the exact matrix element 
calculation, without the use of matrix element corrections. The high pt region is better 
reproduced if the maximum virtuality Q^ax is set equal to the squared partonic center of 
mass energy, s, rather than m'jj. This is equivalent to applying the parton shower to all of 
phase space. However, this has the consequence of depleting the low px region as 'too much' 
showering causes events to migrate out of the peak. The appropriate scale to use in PYTHIA 
(or any Monte Carlo) depends on the pt range to be probed. If matrix element information 
is used to constrain the behavior, the correct high pt cross section can be obtained while 
still using the lower scale for showering. The incorporation of matrix element corrections to 
Higgs production (involving the processes gq qH^qq gH, gg gH) is the next logical 
project for the Monte Carlo experts, in order to accurately describe the high px region. 



A comparison of the two versions of PYTHIA and of ResBos is also shown in Figure 11 



for the case of Higgs production (at a Higgs mass of 100 GeV) at the Tevatron with center- 
of-mass energy of 2.0 TeV. The same quantitative features are observed as at the LHC: the 
newer version of PYTHIA agrees better with ResBos in describing the low pt shape, and there 
is a falloff at high pt unless the larger virtuality is used for the for the parton showers. 
The default (rms) value of the non-perturbative /ct (0.44 GeV) was used for the PYTHIA 
predictions for Higgs production. 

The older version of PYTHIA produces too many Higgs events at moderate px (in compar- 
ison to ResBos) at both the Tevatron and the LHC. Two changes have been implemented 
in the newer version. The first change is that a cut is placed on the combination of z and 

values in a branching: u = — s{l — z) < 0, where s refers to the subsystem of the 
hard scattering plus the shower partons considered to that point. The association with u 
is relevant if the branching is interpreted in terms of a 2 ^ 2 hard scattering. The corner 
of emissions that do not respect this requirement occurs when the value of the spacelike 
emitting parton is little changed and the z value of the branching is close to unity. This 
effect is mainly for the hardest emission (largest Q'^). The net result of this requirement is 
a substantial reduction in the total amount of gluon radiation [|19|. In the second change, 
the parameter for the minimum gluon energy emitted in spacelike showers is modified by an 
extra factor roughly corresponding to the I/7 factor for the boost to the hard subprocess 



frame The effect of this change is to increase the amount of gluon radiation. Thus, the 
two effects are in opposite directions but with the first effect being dominant. 

This difference in the pt distribution between the two versions of PYTHIA could have an 
impact on the analysis strategies for Higgs searches at the LHC. For example, for the CMS 
detector, the higher px activity associated with Higgs production in version 5.7 would have 
allowed for a more precise determination of the event vertex from which the Higgs (decaying 
into two photons) originated. Vertex pointing with the photons is not possible in the CMS 



^"^Such branchings are kinematically allowed, but since matrix element corrections would assume initial 
state partons to have — 0, a non-physical u results (and thus no possibility to impose matrix element 
corrections). The correct behavior is beyond the predictive power of leading log Monte Carlos. 



97 




gg ^H+XatTevatron 

m^= 1 00 GeV,CTEQ4M, Vs=2 TeV 

ResBos98.07.14 

PYTHIA5.7 default 

PYTHIA6. 122 default 

PYTHIA6.122Q^ 



10 15 20 25 30 35 40 45 50 

Pj(GeV) 




p/GeV) 



Figure 11: A comparison of predictions for the Higgs pt distribution at the Tevatron from 
ResBos and from two recent versions of PYTHIA. The ResBos and PYTHIA predictions have been 
normalized to the same area. 
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barrel region, and the large number of interactions occuring with high intensity running will 
mean a substantial probability that at least one of the interactions will produce jets at low 
to moderate Et- This could lead to the wrong vertex being chosen for the Higgs, leading to 
a significant degradation in the 77 effective mass resolution. ||2^ In principle, this problem 
could affect the pt distribution for all PYTHIA processes. In practice, it affects only gg initial 
states, due to the enhanced probability for branching with such an initial state. 

As an exercise, an 80 GeV W and an 80 GeV Higgs were generated at the Tevatron using 



PYTHIAS. 7 |£T|. A comparison of the distribution of values of u and the virtuality Q for the 
two processes indicates a greater tendency for the Higgs virtuality to be near the maximum 
value and for there to be a larger number of Higgs events with positive u (than W events). 

6 Comparison with HERWIG 

The variation between versions 5.7 and 6.1 of PYTHIA gives an indication of the uncertainties 
due to the types of choices that can be made in Monte Carlos. The requirement that u be 
negative for all branchings is a choice rather than an absolute requirement. Perhaps the 
better agreement of version 6.1 with ResBos is an indication that the adoption of the u 
restrictions was correct. Of course, there may be other changes to PYTHIA which would also 
lead to better agreement with ResBos for this variable. 

Since there are a variey of choices that can be made in Monte Carlo implementations, 
it is instructive to compare the predictions for the pt distribution for Higgs production 
from ResBos and PYTHIA with that from HERWIG (version 5.6, also using the CTEQ4M 
parton distribution functions). The HERWIG prediction is shown in Figure |T2| along with the 
PYTHIA and ResBos predictions, all normalized to the ResBos prediction. ^ (In all cases, 
the CTEQ4M parton distribution was used.) The predictions from HERWIG and PYTHIA 6.1 
are very similar, with the HERWIG prediction matching the ResBos shape somewhat better 
at low Pt- For reference, the absolutely normalized predictions from ResBos, PYTHIA and 



HERWIG for the pt distribution of a 150 GeV Higgs at the LHC are shown in Figure |T3. 



7 Non-perturbative kr 

A question still remains as to the appropriate value of non-perturbative to input in the 
Monte Carlos to achieve a better agreement in shape, both at the Tevatron and at the LHC. 



In Figures |1J and |1^ are shown comparisons of ResBos and PYTHIA predictions for the Higgs 
Pt distribution at the Tevatron and LHC. The PYTHIA prediction (now version 6.1 alone) 
is shown with several values of non-perturbative fcy- Suprisingly, no difference is observed 
between the predictions with the different values of kx, with the peak in PYTHIA always 
being somewhat below that of ResBos. This insensitivity can be understood from the plots 
at the bottom of the two figures which show the sum of the non-perturbative initial state 
kr (^ri+fcT2) at Qo and at the hard scatter scale Q. Most of the kx is radiated away, with 
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The normalization factors (ResBos/Monte Carlo) are PYTHIA (both versions) (1.68) and HERWIG (1.84). 



99 




Figure 12: A comparison of predictions for the Higgs pt distribution at the LHC from ResBos, 
two recent versions of PYTHIA and HERWIG. The ResBos, PYTHIA and HERWIG predictions have 
been normalized to the same area. 
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Figure 13: A comparison of predictions for the Higgs pt distribution at the LHC from ResBos 
and from two recent versions of PYTHIA. The ResBos and PYTHIA predictions have their absolute 
normalizations. 
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this effect being larger (as expected) at the LHC. The large gluon radiation probability from 
a gluon-gluon initial state (and the greater phase space available at the LHC) lead to a 
stronger degradation of the non-perturbative fcr than was observed with Z° production at 
the Tevatron. 
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Figure 14: (top) A comparison of the PYTHIA predictions for the pr distribution of a 100 GeV 
Higgs at the Tevatron using the default (rms) non-perturbative kx (0.44 GeV) and a larger value 
(4 GeV), at the initial scale Qq and at the hard scatter scale. Also shown is the ResBos prediction 
(bottom) The vector sum of the intrinsic kx (/^ti+^T2) foi' the two initial state partons at the 
initial scale Qo and at the hard scattering scale for the two values of intrinsic kr- 

For completeness, a comparison of PYTHIA and ResBos is shown in Figure for boson 
production at the LHC. There are two points that are somewhat surprising. There is still 
a very strong sensitivity to the value of the non-perturbative kr used in the smearing, and 
the best agreement with ResBos is obtained with the default value (0.44 GeV), in contrast 
to the 2 GeV needed at the Tevatron. Note again the agreement of PYTHIA with ResBos at 
the highest values of Z° due to the explicit matrix element corrections applied. 
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Figure 15: (top) A comparison of the PYTHIA predictions for the pt distribution of a 150 GeV 
Higgs at the LHC using the default (rms) non- perturbative (0.44 GeV) and a larger value (4 
GeV), at the initial scale Qo and at the hard scatter scale. Also shown is the ResBos prediction, 
(bottom) The vector sum of the intrinsic kx {kTi+kT2) for the two initial state partons at the 
initial scale Qq and at the hard scattering scale for the two values of intrinsic kx- 
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Figure 16: A comparison of the predictions for the px distribution for production at the 
LHC from PYTHIA and ResBos, where several values of kx have been used to make the PYTHIA 
predictions. 
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The sum of the incoming parton kx distributions, both at the scale Qq and at the hard 
scattering scale, are shown in Figure |l^ for several different starting (rms) values of primor- 
dial kx (per parton). There is substantially less radiation for a qq initial state than for a gg 
initial state (as in the case of the Higgs), leading to a noticeable dependence of the Z° pt 
distribution on the primordial kr distribution. 
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Figure 17: A comparison of the total initial state kr {kTi + kT2) distributions for production 
at the LHC from PYTHIA, both at the initial scale Qq and at the hard scattering scale, for several 
(rms) values of the initial state /cy- The mean and rms numbers refer to the values at the hard 
scattering scale. 



8 Conclusions 

An understanding of the signature for Higgs boson production at either the Tevatron or 
LHC depends upon the understanding of the details of soft gluon emission from the initial 
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state partons. This soft gluon emission can be modelled either in a Monte Carlo or in a kr 
resummation program, with various choices possible in both implementations. A comparison 
of the two approaches is useful to understand the strengths and weaknesses of each. The 
data from the Tevatron that either exists now, or will exist in Run 2, will be extremely useful 
to test both approaches. 
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Automatic Computation of LHC Processes 



E. Boos, V. Ilyin, K. Kato, A. Pukhov, A. Semenov, A. Skatchkova 



Automatic computation is a new approach to HEP computing. The first such systems, 
GRACE H, FeynArt/FeynCalc [|| and CompHEP 0, were reported at the 1st Interna- 
tional Workshop AIHENP held on March, 1990 in Lyon-Villeurbanne (France). Under this 
terminology, automatic computation system (ACS), we assume, as a distinguishing feature, 
the generation of the computing code for a specific collision process with the aid of another 
code. 

ACS's are now used widely by phenomenologists for the calculation of many collision pro- 
cesses. For example, the GRACE and CompHEP systems were used in the LEP2 Workshop 
1^, and for evaluation of processes at TeV linear colliders 0. With ACS one can calculate 
all collision processes within a given physical model, where by physical model we mean the 
set of Feynman rules. Recent developments with the LanHEP package |11|] have opened a 



possibility to derive Feynman rules in the form of the ACS intrinsic physical model in a 
fully automatic way, starting from the Lagrangian. Now, not only the Standard Model but 
a number of its extensions, like SUSY models, are implemented in ACS. A general review of 
this new approach is given in these Proceedings by K.Kato together with discussion of main 
directions of the ACS development. Here, we discuss in more detail specifics of the ACS 
applications in LHC phenomenology, and in particular to the evaluation of QCD processes. 

To close this preview we list below the main ACS options in order to provide an idea for 
users of what is available: 

selection of physical model (Lagrangian) and hard subprocess, 

Feynman diagram generation, 

m) generation of the code for matrix element, 

IV) convolution with parton distributions, 

generation of kinematics (phase space parameterization) with regularization of kine- 
matical peaks, 

VI) integration over the phase space (evaluation of cross section), 

vn) generation of events at partonic level, including the interface to hadronization tools. 
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1 The problem of multiparticle final states: why auto- 
matic computations? 

We start from the problem of the accurate evaluation of hard subprocesses in the case of 
multiparticle final states. 

When physicists simulate HEP processes with such generators as PYTHIA ||^, ISAJET 
or HERWIG § they use a data base of hard subprocesses implemented in these packages. 
It means that a) matrix elements are stored as formulas, and b) a knowledge about the 
behaviour of matrix elements as phase space integrands are coded in the form of modelling 
functions in order to get a fast generation of the partonic events. One can note that these 
data bases include a rather simple variety of subprocesses, mainly of the 2 — 2 type. 

If one tries to include a hard subprocesses with 3, 4 and more particles in the final state, 
large problems appear. Indeed, the size of matrix elements increases very fast. For example, 
in the 2 — 4 case, the size of the code for evaluation of helicity amplitudes for one subprocess 
is at the lOO's of Kbyte level. However, the main problem lies elsewhere; it is impossible 
to construct an analytical formula for matching peaks and other structures of the rather 
singular behaviour of matrix elements. In the 2 — > 3 case, phase space has 4 dimensions 
plus two for convolution with the PDF's; in the 2 — 4 case 7+2 dimensions are present, 
and so on. As a result, the set of kinematical singularities has, as a rule, a very complicated 
positioning in the multidimensional phase space. This particular problem was not solved 
accurately, e.g., when the Zbb final state was implemented in PYTHIA 5.7. 

Let us discuss further this somewhat delicate point. It is necessary to integrate the 
squared matrix element over the phase space in order to obtain the cross section. Precise 
information about the behaviour of the integrand then is necessary for further event gen- 
eration. This information can be obtained at the step of the phase space integration. The 
problem is that the integrand, as a rule, has a singular behaviour with sharp kinemati- 
cal peaks connected with different denominators (propagators) of Feynman diagrams. This 
problem is caused, in particular, by the circumstance that one has to take into account 
nonzero masses of particles in many important cases, especially if accurate calculations are 
needed. The masses of elementary particles can have extremely small values, e.g. the masses 
of the 1st generation quarks (few MeV), and can also be zero (for the photon and gluon). 
At the same time, other parameters are of the order of a hundred GeV, e.g. masses of W 
and Z bosons and t-quark. Moreover the collision energy can also have a very large value, 
e.g. 14000 GeV for LHC processes, and some other important variables, like the transverse 
energy of jets, are at the hundred GeV scale or even greater. This huge scale interval for 
different parameters causes serious computational problems which result in the appearance 
of sharp peaks for the integrand. So, at the step of phase space parameterization, one has 
to include a regularization of the integration measure in order to smooth the singularities of 
the integrand. 

LHC phenomenology requires the computation of a wide spectrum of hard subprocesses 
with 3,4 and even more particles (partons) in the final state. This is a common need for all of 
the physics working groups: QCD, EW, Higgs, SUSY etc. These requirements are especially 
common for new physics searches. Furthermore, a major challenge results from background 
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analyses, where QCD subprocesses play a major role with, in many cases, multiparton fi- 
nal states. As a rule, for each LHC discovery reaction, one should calculate several QCD 
processes giving both irreducible and reducible backgrounds. The parton-shower generation 
of multiparticle final states is usually utilized in this situation. However, this can be too 
crude an approximation for many important studies leading to sometimes grossly unreliable 
predictions. 

We emphasize that ACS can give the possibility to compute accurately a variety of LHC 
processes (and in particular QCD processes) with 3, 4 and more bodies in the final partonic 
state. Indeed, the first problem (the size of the matrix element computing code and the 
difficulty to obtain the exact matrix elements) is solved in ACS by the automatic generation 
of the corresponding code. This step is fast and pain-free from the viewpoint of the user. The 
second problem (the accurate integration over the multidimensional phase space) is solved 
in ACS by the generation of kinematics where the necessary regularizations are included. 
For example, in CompHEP the user has to list a set of singular propagators using the menu 
system. After that, the code for kinematics (with regularizations) is generated automatically. 
In GRACE, a library of kinematics (with regularizations) is used and the user has to make the 
necessary choices. Thus, the high art (mathematics and programming), needed to elaborate 
the sharp peaks, is enclosed in a form hidden from the user, giving him a possibility to 
compute complicated processes. 

At the step of integration over the phase space, ACS uses adaptive Monte Carlo inte- 
grators (VEGAS [g in CompHEP, and BASES |3l in GRACE). To match the complete 
set of singularities, the multichannel MC approach |jlO| is utilized. As a result, the phase 
space grid is created with an accurate mapping of the singular behaviour of the matrix ele- 
ment. This complex body of information (let us call it MEgrid) has a rather large size that 
rapidly increases with the number of phase space dimensions. One can consider MEgrid 
as a multidimensional analog of the modelling function used in PYTHIA and other similar 
packages for the effective generation of partonic events. Of course, this information can not 
be expressed in analytical form. It is necessary to point out also that the convolution with 
parton distributions should be made at the same stage as the integration over the phase 
space. Indeed, the contributions of different subspaces (in particular different kinematical 
peaks) can depend largely on the partonic coUison energy, s, resulting from the information 
stored in MEgrid. 

ACS can be considered as a tool for the automatic generation of the data base of hard 
subprocesses for physical generators like PYTHIA, ISAJET and HERWIG. However, it is 
difficult to imagine that the data base created can be implemented in the code of these 
generators. This is due, first of all, to the size of the generated codes. Thus, we propose a 
two stage approach. At the first stage, ACS is used resulting in a cross section and MEgrid for 
the subprocess under evaluation. This can be stored in a special LHC data base. This data 
base can be used for the effective generation of partonic events. In GRACE, it is available 
with the SPRING |13] generator, and in CompHEP by a relatively straightforward procedure 
and an effective generator is under construction). The output is a partonic event flow that 
can be used as an input for physical generators like PYTHIA, ISAJET and HERWIG; this 
is second stage of the full simulations. At this stage partons (quarks and gluons) should be 
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hadronized and unstable particles decayed. We note that in PYTHIA there exists a rather 
flexible interface for such a two stage approach, the option for inclusion of external processes 
through the routine PYUPEV. 

This is a general view on the way in which ACS (GRACE and CompHEP in particular) 
can be used for the simulation of LHC processes. Below we discuss some specific features of 
this technology with special attention to QCD aspects. 

2 General Considerations about GRACE and Com- 
pHEP 

With CompHEP and GRACE the user can evaluate hard subprocesses at the tree level, i.e. 
Feynman diagrams are generated without loops. This corresponds to the basic request for 
LHC phenomenology. However, it is well known that QCD next-to-lcading corrections are 
large, as a rule, for LHC processes. In many cases these corections can be accounted for in 
the form of so-called K-factors and one can include them easily in tree level calculations. 
Nevertheless, in many important cases an explicit evaluation of higher order corrections is 
necessary. At this moment it is not clear how to automate calculations of LHC processes at 
NLO level. The problem is connected, in particular, with the circumstance that different re- 
summations of large logarithms should be included in order to get reliable NLO predictions. 
The interface between resummation techniques and event generators is under intensive dis- 
cussion now, and at the present Workshop also. We note in this respect, that the GRACE 
package inchidcs the code for the generation and evaluation of one-loop diagrams. 

The user interface should provide the possibility to calculate complicated processes for 
users not experienced in programming. CompHEP has a (graphical) menu driven system 
where the user proceeds through all steps of the calculation without any programming. In 
GRACE, the user needs to write a few simple interface routines. 

The information on the GRACE system and its products can be found at 

http: / /www-sc. kek.jp/minami/ 

The code of CompHEP is free for users and one can take it from the following Web page 

http://theory.npi.msu.su/comphep 
where the user's manual is available in PS format (see also hep-ph/9908288). The CompHEP 
package, adapted for LHC processes (sec next section) is installed on the SUN platform 

/afs/cern.ch/cms/physics/COMPHEP/v33-SUN 
and on the PC/Linux platform 

/afs/cern.ch/cms/physics/COMPHEP/v33-Linux 

The interface between CompHEP and PYTHIA has beencreated with the corresponding 
code available from the address: 

/ afs/cern .ch / cms/ physics/com p-pyth 
where one can find a short description in the file README. With this interface, the partonic 
event flow for any processes calculated with CompHEP can be sent to PYTHIA to generate 
physical events. 
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3 QCD aspects in automatic computations 



In this section we discuss the treatment of QCD effects in the case of automatic computations, 
and consider CompHEP options as an example. As has been discussed above, CompHEP 
calculates only at tree level, and so at leading order (LO). Thus, the main problems concern- 
ing an accurate accounting of QCD effects are outside the discussion. Nevertheless, some 
important QCD dependencies can not be avoided even at tree level and the corresponding 
options are available for users. These aspects are: a) parton distributions, b) QCD scale, 
and c) running strong coupling constant. 

Parton distributions. 

In CompHEP the specification of initial states in the collision process under evaluation 
can include the convolution with structure function. So, in the case of hadron collisions, the 
cross section is evaluated as an integral 



where fi are the corresponding parton distributions, a is the partonic cross section and Q is 
the QCD scale. 

In CompHEP v. 33, installed at CERN (see address above), parton distributions from 
two pdf families are implemented, MRS and CTEQ, and in particular the following versions: 
1) MRS(A') and MRS(G) H, 2) CTEQ41 and CTEQ4m H. Note that CTEQ41 is a LO 
parametrization, while in all others the evolution of parton distributions is realized in the 
next-to-leading (NLO) approximation. 

In addition, a special interface is available to include a user's defined parton distribution. 
By this way one can implement the most recent parametrizations (at this moment CTEQ5 
and MSRT). See the CompHEP user's manual for the corresponding procedure (section 



Choice of QCD scale. 

The factorization theorem states that parton distributions depend not only on the Bjorken 
variable x but also on some parameter Q which characterizes the energy (or momentum) 
scale at which the QCD effects give the main contribution to the hard subprocess. This 
parameter is set by the user for each specific QCD process. It is possible to set a fixed scale 
or a running scale. In the later case, can be a squared linear combination of any set of 
initial and outgoing particles momenta, for example, {pi — Ps)^, {pi—Ps — P4)^, {Ps + P^}^ and 
so on (initial and outgoing momenta enter with opposite signs). The corresponding settings 
are made through the option QCD SCALE of the numerical menu. 

Running ag. 

It is the nature of strong interactions that there is no absolute normalization of the cor- 
responding coupling constant. This is in contrast to the value 1/137 for the electromagnetic 
constant known with high accuracy from classical electrodynamical experiments. Instead, 
we have a function for as rather than a constant. Even in the leading order approximation, 
a^^ = 67r/[(33 — 2nf) logQ/A^"-^^], where Q is the QCD scale of the hard subprocess under 




3.6.2). 
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the evaluation with A the so-called QCD fundamental parameter. Then, nj is the number of 
parton flavours with masses lower than Q. The Uf dependence in QCD parameter A matches 
the quark mass threshold effects. 

In the version of CompHEP installed at CERN, the running as is realized in LO, NLO 
and NNLO. All of the corresponding formulas are based on the choice of A^^-* (see Review of 
Particle Physics [0 p. 81. The user can find the corresponding switch in the option QCD 



SCALE in the numerical menu. 



Therefore, to evaluate QCD processes with CompHEP, one has, first of all, to fix the 
normalization of as- The popular normalization point is the mass of Z boson, Q = Mz- 
By changing the parameter A'-^-', the user should set the strong coupling at the appropriate 
value, say as{Mz) = 0.118. Then, the user has to choose the order for the running (LO, 
NLO or NLO). Finally, the user has to define the QCD scale Q, which will be used both for 
the evaluation of as at this scale and in the parton distributions. 

Thus, the complete LO calculations of LHC processes are available, with the matrix 
element, parton distributions and running strong coupling constant calculated in the low- 
est order of the perturbation theory. This is a self-consistent starting point in the phe- 
nomenological analysis; when/where higher order corrections are available, all elements of 
the calculation can be calculated at the higher order and then compared to the leading order 
result. 

However, it is also common for phenomenologists use a mixed approach, with the matrix 
element evaluated at LO but the parton distributions and running as taken in NLO approx- 
imation. Surely, only a part of the NLO corrections is accounted for in this case. We note 
that this option is also available for users in CompHEP calculations. 



4 Partonic Subprocesses 

When hadronic collision processes are evaluated, especially in the case of a large number 
of final state particles, one serious problem is the large number of contributing partonic 
subprocesses. This occurs because of the quark and gluon content of the initial hadrons 
and CKM quark mixing. For example, at LHC energies, 180 subprocesses contribute to the 
W+2jets and 292 subprocesses to the W+3jets production (taking into account only quarks 
of the first two generations ). During this workshop a new method has been proposed to avoid 
a multiplication of channels due to the mixture of quark states |T^. The method leads to a 



simple modification of the rules for the evaluation of the cross sections and distributions. It 
is based on the unitary rotation of down quarks, thus providing the transportation of mixing 
matrix elements from vertices of Feynman diagrams to the parton distribution functions. As 
a result, one can calculate cross sections with a significantly smaller number of subprocesses 
contributing. For the examples mentioned above, one needs to evaluate (with the new rules) 
only 21 and 33 subprocesses, respectively, in order to compute the cross sections for the 
W + 2jets and W + Sjets processes. The matrix elements of the subprocesses are calculated 
without quark mixing, but with a modified PDF convolution which now depends on the quark 
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mixing angle and the topologies of the gauge invariant classes of diagrams contributing to 
the subproccsscs. The method proposed has been incorporated into the CompHEP program 
and checked with many examples. 

5 PEVLIB - library of LHC processes 

Now the library of CompHEP based event generators for LHC processes has been started at 
the address: 

/afs/cern.ch/cms/ physics/PEVLIB 

The following QCD processes are stored already in this library: Zbb, Wbb, tibb and some 
others. In the corresponding directories (with the names literally corresponding to the final 
states) unweighted events are stored (see the files README in these directories for details 
about evaluation of the corresponding samples of events) . 

Together with the CompHEP-PYTHIA interface code (sec discussion above) these event 
files can be used for full LHC simulations with the help of PYTHIA package and detector 
simulation software in the standard way. 

Let us discuss the process Zbb in order to supply more details. In the directory 

/afs/cern.ch/cms/physics/PEVLIB/Z_b_b 
the file __pevZbb includes about 200000 unweighted events with the final state Zbb. Each 
event includes the Lorentz momenta of all particles in the initial and final states. In the 
present version of this library, there is no information about the color fiow in the event. Thus, 
only the Independent Fragmentation Model can be used for the hadronization. Of course, the 
user can use the Lund model; for this one has to define the corresponding color flows by hand 
in the routine RYU REV. The same remark is valid also for FSR (final state radiation), what 
is switched off by default in CompHEP-PYTHIA interface. In the same time ISR (initial 
state radiation) is switched on automatically. 

Note that the user can generate more events than stored in the library. In the corre- 
sponding subdirectories (indicated in the file README) the generators are stored in the 
form of the executable code (at this moment for SUN platform only). These generators 
are the corresponding CompHEP codes for the process with the proper set of kinematical 
regularizations. 

The library PEVLIB is under construction now. New processes will be added. The 
structure and user's interface will be developed. 
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Monte Carlo Event Generators at NLO 



J. Collins 



Abstract 

I am concerned here with QCD calculations for processes with a hard scattering 
— production of heavy particles, jets, etc. The most accurate calculations are by the 
"analytic" methods. But the most useful calculations for direct comparison with data 
are done by Monte-Carlo event generators, and these are limited in accuracy. In par- 
ticular, there is as yet no known method of systematically improving the Monte-Carlo 
calculations by incorporating the non-logarithmic parts of higher order pcrturbative 
corrections. This creates limitations on the analysis of future data. Therefore I sum- 
marize some ideas for remedying the situation. 

Factorization for inclusive processes 

• Normal proofs of factorization are for inclusive cross sections. 

• To get the simplifications in the factorization formula, as compared with the exact 
cross section, one makes suitable approximations. 

• The approximation is to the hard scattering part i7 of a cross section, as in: 




Figure 1: 

which is a graph for the DIS cross section. The kinematics of the external lines of H 
are changed to massless on-shell partons with zero transverse momentum. Also the 
internal (light parton) lines of H are made massless. 

• The approximation is correct to the leading power of rn? /Q"^, where m is a typical 
hadronic scale and Q is the scale of the hard scattering (e.g., Q"^ is the virtuality of 
the virtual photon in DIS) . 
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• Subtractions are applied to the hard scattering to cancel double counting. A high- 
order graph for the hard scattering has subtractions that correspond to smaller hard 
subgraphs (and hence smaller regions of momentum space). 

• The exact form of the subtractions corresponds to the approximations made for the 
smaller regions. 

Monte-Carlo event generators at NLO (and beyond) Current event generators es- 
sentially use the leading order for the hard-scattering coupled to an algorithm that approxi- 
mates the exclusive structure of the low virtuality parts of graphs for the cross section. The 
algorithm is in an improved leading logarithm approximation. 

With the exception of the recent paper by Friberg and Sjostrand previous attempts, 
e.g., [0, at incorporating NLO corrections have tended to implement them by a reweighting 
of the events generated by showering from the LO matrix elements. Normal NLO "analytic" 
calculations for the hard-scattering coefficients for inclusive scattering are not usable as they 
stand in a Monte-Carlo event generator, because they involve singular distributions. 

Here I summarize some ideas 0] to remedy this situation. I have applied them to one 
specific case in DIS in Ref. |Q, but I think they can be generalized: 

• Separate groups of events are generated with LO and NLO hard scattering coefficients. 

• To obtain the NLO coefficients, the same general ideas are used as in inclusive hard 
scattering. 

• The methods are applied both to the hard scattering itself and to the showering kernels. 

• However, because we are now working with exclusive processes, the form of the approxi- 
mations is different. The approximated graphs must satisfy the following requirements: 

— Exact 4-momentum conservation must be obeyed for each subprocess (hard scat- 
tering or one stage of the showering). I.e., pf = p^, where pf and p^ are the total 
initial and final momenta the the subprocess. 

— The approximation may change the momenta of the internal lines but it must 
preserve the momenta of the external lines. This avoids the problem with having 
singular distributions. 

• A cut-off is applied, for otherwise the approximated graphs give ultra-violet divergences 
when integrated to large transverse momentum. The kinematics associated with exact 
momentum conservation do provide a cutoff, but such a cutoff tends to violate factor- 
ization of the momentum-space integrals. So a separate artificial cutoff is better, and 
probably makes for a better implementation of the algorithm. 

• Conventionally, in a Monte-Carlo a sharp cutoff is used. But a smooth cut-off will 
probably be better for numerical work. It will also make it easier to get positive cross 
sections. 
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Figure 2: Born graph for DIS 



• The dependence on the cutoff is a generahzed renormahzation-group transformation, 
and the exact cross section is independent of the form of the cutoff. 

• Separate exphcit soft factors are needed, as in the factorization theorem for the qt 
distribution for the Drell-Yan process; unhke the case of an inclusive cross section the 
cancellation of the soft region is not complete. This issue is not treated in Refs. [|, |^, 
but will need further work, which is in progress. 



Example of application to DIS []^ The parton model graph of Fig. |^ is combined with 
showering to give a LO cross section that can be summarized as 

(Tlo = Born graph x initial-state showering x final-state showering. (1) 

The NLO cross section is obtained from subtracted one-loop graphs, and the hard- 
scattering coefficient is of the form 






X Jacobian x cutoff function. 




The first term is an unsubtracted NLO graph. The subtraction corresponds to the approx- 
imation made in LO. Above the horizontal line, the following replacement for the internal 
momentum is made: 



/ massless on-shell, zero transverse momentum 

Lorentz-transformed momentum with correct final state. 



(3) 
(4) 



The first step is the standard approximation. The second step is needed to obtain the cor- 
rect kinematics with conservation of 4-momentum. It is somewhat non-trivial to implement 
consistently since the subtraction term is needed when it is far from the collinear region. The 
bulk of my work in Refs. 0, |^ is about constructing a definite correct and consistent imple- 
mentation. Correctness here means that the subtraction term is the order as approximation 
to the showering in the LO Monte Carlo algorithm. 



119 



Differences between conventional "analytic" method and Monte-Carlo method 

These differences can be illustrated by the following mathematical example. Warning: In a 
number of respects this example is over-simplified. For example, it does not take account of 
the parton densities. As explained in Ref. IQ, the parton densities in the Monte Carlo are 
not in the usual MS scheme. However, they are related to them by definite formulae. 
Let the unapproximated unsubtracted integrand at NLO be 

da 



d^kr (Q2 + kl + m?) {kl + m?) ' 

The conventional approach obtains the hard-scattering coefficient by setting the mass m to 
zero. The integral is then infinite, and a subtraction is inserted which consists of a delta 
function at /c^ = with an infinite coefficient. The result is a + distribution, which has a 
finite integral: 



da ( l\ 



d^kr Q^ + k^\k^J, (g2 + A;2 ) A;2 



C5^'\kT), (6) 



where C is an infinite constant, defined with the aid, for example, of dimensional regular- 
ization. 

In my new approach the subtracted integrand is 

da f{kT/^i)J{kT)_ 



d^kr {Q^ + kl)kl k^ 

where f{kT/ ^i) is the previously mentioned cutoff function, which is unity for small kx and 
zero for large kx- As usual /i is the factorization scale. The factor J{kT) symbolizes the 
Jacobian that is necessary in the transformation from the variables appropriate for generation 
of events and the variables for the measured particles. 
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NLO and NNLO Calculations 



V. Del Duca and G. Heinrich 



1 The NLO and NNLO program 

QCD calculations of multijet rates beyond the leading order (LO) in the strong coupling 
constant are usually quite involved. Nowadays we know (see Section |T]^) how to perform 
in general calculations of the next-to-leading order (NLO) corrections to multijet rates, and 
almost every process of interest has been computed to that accuracy. Instead, the calculation 
of the next-to-next-to-leading order (NNLO) corrections is still at an organizational stage 
and represents a main challenge. Why should we perform calculations which are technically 
so complicated ? 

The general motivation is that the calculation of the NLO corrections allows us to es- 
timate reliably a given production rate, while the NNLO corrections allow us to estimate 
the theoretical uncertainty on the production rate. This is achieved by reducing the depen- 
dence of the cross section on the renormalization scale, /i^, and for processes with strongly- 
interacting incoming particles the dependence on the factorization scale, fip, as well. 

An example is the determination of as from event shape variables in e~^e~ 3 jets 
Although the NLO contributions to e'^e~ 3 jets have been computed for some time 
now ^, the NNLO contributions have yet to be obtained. A calculation of these NNLO 
contributions would be needed to further reduce the theoretical uncertainty in the determi- 
nation of as- 

We present in Section |1.1| an additional motivation for performing QCD calculations at 
NNLO, which is specific to the LHC program, and we outline in Section |L2| how QCD 
calculations at NLO are implemented and in Section [L.3| how QCD calculations at NNLO 
could be performed. 



1.1 Higgs production 

The main goal of the LHC physics program is the investigation of the mechanism of the 
electroweak symmetry breaking, and namely the search and detection of the Higgs boson. If 
the Higgs boson is light (100 GeV < mn < 140 GeV), the rare decay channel in two pho- 
tons, H-^ 77, provides the best signature @, |, ||, |^. Since the signal-to-background ratio 
is quite low (~ 7%), the analysis of this channel promises to be demanding. Our theoretical 
understanding of signal and background is still preliminary: the NLO QCD corrections to 
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the signal are known to be quite large (C(100%)) 0. Also the QCD background pp 77, 
given at LO by the parton subprocess qq 77, is known to NLO 0, with the full NLO 
fragmentation contributions having just been evaluated |T^. However, pp 77 receives 
a sizeble contribution from NNLO corrections because of the large gluon luminosity of the 
subprocess gg — > 77 appearing first at NNLO Thus in order to have a reliable theoret- 

ical estimate both the signal and the background need to be determined at least to NNLO 
accuracy. 

In order to improve the signal-to-background ratio, Higgs production in association with 
a high transverse energy (Et) jet, pp 



12 . This 



H jet — >• 'j'jjet, has been considered 
production rate offers the advantage of being more flexible in choosing suitable acceptance 
cuts to curb the background, pp H jet is known to LO exactly [|l^, while the NLO 
corrections have been computed in the infinite top-mass limit. The NLO corrections 
to the signal are large. However, it is believed that the background, pp — 'j'y jet, can 
be more reliably calculated because LO production is dominated by the parton subprocess 
19 ^ Qll) which benefits from the large gluon luminosity, while the subprocess gg 
gj'j, which is believed to dominate the NNLO contribution, yields a comparatively smaller 
contribution ||lT], |15[. Thus, even though the signal, pp ^ H jet, likely needs be computed 
at NNLO accuracy, it should suffice to evaluate the background, pp — > •yy jet, at NLO. The 
NLO corrections to the background, though, have yet to be computed, with the appropriate 
QCD amplitudes having just been evaluated lH 



1.2 NLO algorithms and one- loop amplitudes 



In recent years it has become clear how to construct general-purpose algorithms for the 
calculation of multijet rates at NLO accuracy. The crucial point is to organise the can- 
cellation of the infrared (i.e. coUinear and soft) singularities of the QCD amplitudes in a 
universal, i.e. process- independent, way. The universal terms in a NLO calculation are 
given by the tree-level splitting |]1^ and eikonal 
structure of the poles of the one-loop amplitudes | 



IS 



IS] functions, and by the universal 
The universal NLO terms and 
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the process-dependent amplitudes are combined into effective matrix elements, which are 
devoid of singularities. The various NLO algorithms (phase-space slicing pO|, |2^, subtrac- 
tion method 
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|, dipole formalism and subtraction-improved slicing ^6[) provide 
different methods to construct the effective matrix elements. These can be integrated, an- 
alytically or otherwise numerically, in four dimensions. The integration can be performed 
with arbitrary experimental acceptance cuts. 

Then the remaining work to be performed to calculate a production rate at NLO is 
to compute the appropriate tree and one-loop amplitudes. To compute n-jet production 
at NLO, two sets of amplitudes are required: a) n-particle production amplitudes at tree 
level and one loop; b) {n + l)-particle production amplitudes at tree level. If the one-loop 
amplitudes are regularised through dimensional regularization (DR) by evaluating them in 
d = 4 — 2e dimensions, it suffices at NLO to compute them to 0{e^). As an example, in Fig. |I] 
we show the squared matrix elements which are required to calculate the NLO corrections 
to e"''e~ — s> 3 jets. 
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Figure 1: Squared matrix elements which contribute the NLO corrections to e+e" —>■ 3 jets. The 
dashed line represents a massive vector boson, 'j*,W,Z. a) interference term between one-loop 
and tree amplitudes. The final-state partons are a qq pair and a gluon. b) square of a tree 
amplitude. The final-state partons are a qq pair and two gluons, or two qq pairs. In figure b) one 
of the partons is unresolved. 



Efficient methods based on the color decomposition ^7\, ^ of an amphtude in 

color-ordered subamplitudes, which are then projected onto the helicity states of the ex- 
ternal partons, have largely enhanced the ability of computing tree |3ll| and one-loop p2| 
amplitudes. Accordingly, tree amplitudes with up to seven massless partons Q and 
with a vector boson and up to five massless partons |34| have been computed analytically. 
In addition, efficient techniques to evaluate numerically tree multi-parton amplitudes have 
been introduced ||35|, and have been used to compute tree amplitudes with up to eleven 
massless partons ^6|. The calculation of one- loop amplitudes can be reduced to the cal- 
culation of one-loop n-point scalar integrals |3^, ^ 
the computation of one-loop amplitudes with four massless partons 
boson and three massless partons [H0|. However, one- loop scalar integrals present infrared 



The reduction method 

m 



37 allowed 



and with a vector 



divergences, induced by the massless external legs. For one-loop multi-parton amplitudes, 
the infrared divergences hinder the reduction methods of ref . [p7| , |38| . This problem has been 
overcome in ref. |^T|. Accordingly, one- loop amplitudes with five massless partons ^ 
and with a vector boson and four massless partons p3 have been computed analytically. 



The reduction procedure of ref. has been generalised in ref. , where it has been shown 
that any one-loop n-point scalar integral, with n > 4, can be reduced to box scalar integrals, 
and that in the reduction of n-point tensor integrals, all higher dimensional (rf > 4 — 2e) 
n-point integrals with n > 4 drop out. The calculation of one-loop multi-parton amplitudes 
thus can be pushed a step further in the near future. 



1.3 NNLO calculations 



Eventually, a procedure similar to the one followed at NLO will permit the construction 
of general-purpose algorithms at NNLO accuracy. It is mandatory then to fully investi- 
gate the infrared structure of the phase space at NNLO. The universal pieces needed to 
organise the cancellation of the infrared singularities are given by the tree-level double- 
splitting 1^ ^ double-eikonal [0, ^] and splitting-eikonal ^ functions, by the 

functions, and by the universal structure of the 



one-loop splitting ^ 
poles of the two-loop amplitudes 



and eikonal 



These universal pieces have yet to be assembled 



together, to show the cancellation of the infrared divergences at NNLO. 
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c) , d) 




Figure 2: Squared matrix elements which contribute the NNLO corrections to e"^e^ 3 jets. 
The dashed line represents a massive vector boson, 7*,W,Z. a) interference term between 
two-loop and tree amplitudes, and b) square of a one-loop amplitude. In figures a) and b) the 
final-state partons are a qq pair and a gluon. c) interference term between one-loop and tree 
amplitudes. The final-state partons are a qq pair and two gluons or two qq pairs. One of the 
partons is unresolved, d) square of a tree amplitude. The final-state partons are a qq pair and 
three gluons, or two qq pairs and a gluon. Two of the partons are unresolved. 



Then to compute n-jet production at NNLO, three sets of amphtudes are required: a) 
ra-particle production amphtudes at tree level, one loop and two loops; b) {n + l)-particle 
production amplitudes at tree level and one loop; c) {n + 2)-particle production amplitudes 
at tree level. In Fig. we show the squared matrix elements which are required to calculate 
the NNLO corrections to e^e" —>■ 3 jets. In DR at NNLO, the two-loop amplitudes need be 
computed to C(e°), while the one-loop amplitudes must be evaluated to C(e^) [|^, The 
main challenge is the calculation of the two-loop amplitudes. At present, the only amplitude 
known at two loops is the one for V ^ qq with V a massive vector boson, which 
depends only on one kinematic variable. It has been used to evaluate the NNLO corrections 
to Drell-Yan production and to deeply inelastic scattering (DIS) No two- loop 

computations exist for configurations involving more than one kinematic variable, except 
in the case of maximal supersjTiimetry |56]. One of the main obstacles for configurations 
involving two kinematic variables is the analytic computation of the two-loop four-point 
functions with massless external legs, where significant progress has just been achieved. 
These consist of planar double-box integrals non-planar double-box integrals single- 



box integrals with a bubble insertion on one of the propagators and single-box integrals 
with a vertex correction 



30 



The two-loop four-point functions with massless external 
legs are needed for the computation of two-loop amplitudes in parton-parton scattering. 
Finally, the topical processes considered above, i.e. e~^e~ 3 jets and pp — > H jet sport 
configurations involving three kinematic variables and require the analytic computation of 
two-loop four-point functions with a massive external leg. The two-loop four-point functions 
of this kind with up to five different denominators have been derived recently [^, while 
those with six and seven different propagators are still missing. Another obstacle is the 
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color decomposition of two- loop amplitudes, which is not known yet. Substantial progress is 
expected in the next future on all the issues outlined above, which should make the present 
note soon outdated. 

Finally, we mention that in the factorization of coUinear singularities for strongly- 
interacting incoming particles, the evolution of the parton distribution functions (prf/'s) in 
the jet cross section should be determined to an accuracy matching the one of the parton 
cross section. For hadroproduction of jets computed at NLO, one needs the NLO, or two- 
loop, evolution of the pdf's [^, ^ Accordingly for hadroproduction at NNLO the 
evolution of the pdf^s should be computed to NNLO, or three-loop, accuracy. Except for 
the lowest five (four) even-integer moments of the three-loop non-singlet (singlet) splitting 
functions no calculation of the NNLO evolution of the pdf^s exists yet. However, NNLO 
analysis based on the finite set of known moments have been performed for xF^ |68[ and 
F2 (non-singlet |69| and singlet |7^). Furthermore, in ref. |^ a quantitative assessment 
of the importance of the yet unknown higher-order terms has been performed, with the 
conclusion that they should be numerically significant only for Bjorken-scaling x < 10~^. 

The computation of the evolution of the pdf^s at NNLO accuracy is a main challenge 
in QCD. The NLO computation was performed with two different methods, one using the 
operator product expansion (OPE) in a covariant gauge |63|, the other using the light-cone 
axial (LCA) gauge with principal value prescription [^. However, the prescription used in 
ref. has certain shortcomings. Accordingly, the calculation has been repeated in the LCA 

Hi 



gauge using a generally correct prescription 



which makes it amenable to extensions 



beyond NLO. On the other hand, using the OPE method, there had been a problem with 
operator mixing in the singlet sector, which has been fixed ||65|] only recently, and the result 
finally coincides with the one obtained in the LCA gauge in ref. [Q. Thus the calculation of 
the pdf evolution at NLO accuracy is fully under control. Recent proposals for a calculation 
beyond NLO include extensions of the OPE technique, which have been used to recompute 
the NNLO corrections to DIS and a computation of the pdf evolution by combining the 
universal gauge- invariant coUinear pieces WM. For the two-loop pdf evolution, e.g., they are 



the coUinear pieces mentioned at the beginning of this section. 
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Jet Algorithms 



S. Catani and D. Zeppenfeld 



1 Jet algorithms 

Jet algorithms have the task to assign streams of hadrons in hard scattering processes to 
a jet, who's energy, mass and momentum can then be related to a collection of partons in 
a perturbative QCD calculation. Although, at the experimental level, jets can be defined 
by using rather general and intuitive procedures, if we would like to compute jet cross 
sections and properties by using QCD perturbation theory, the definition of jets should fulfil 
stronger constraints to guarantee its perturbative safety. Perturbative safety means that the 
definition has to be infrared safe (jet properties cannot depend on the presence of arbitrarily 
soft partons), collinear safe (jet properties cannot change by replacing a parton with a set 
of collinear partons carrying the same total momentum) and collinearly factorizable (jet 
properties should be insensitive to partons radiated colhnearly to the beam direction). If 
the jet definition is not perturbative safe, we cannot perform calculations order-by-order in 
perturbation theory because they are affected by uncancelled infrared divergences. Of course, 
in the full QCD theory (i.e. beyond perturbation theory) the perturbative divergences are 
regularized by small physical cutoffs related to hadron masses and the finite experimental 
resolution (size of calorimeter cells, energy thresholds, etc.). The physical cutoffs are always 
present, independently of the jet definition. However, in the case of a perturbative safe 
definition, their effects are suppressed by some inverse power of the jet transverse energy 
Et, and thus they can be made small by sufficiently increasing Et- This power suppression 
is not at work in perturbative unsafe jet definitions, where the effects of the small physical 
cutoffs can amount to large corrections (of order unity) to the perturbative results. Thus, 
perturbative safe definitions are preferred. 

Jet algorithms start from a list of "particles" which we would like to freely associate 
with calorimeter cells or hadrons at the experimental level, and with partons in a QCD 
calculation. Each particle i carries a 4-momentum pf, which we take to be massless. The 
task is to select a set of particles which are emitted close to each other in angle and combine 
their momenta to form the momentum of a jet. The selection process is called the "jet 
algorithm" , the momentum addition rule is called the "recombination scheme" . 

Let us start with a discussion of recombination schemes. In a hadron collider environment 
the arbitrary boost of the hard scattering system along the beam axis needs to be taken into 
account in the definition of angles to ensure collinear factorizability. This is achieved by using 
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transverse momentum, = \Jpi + Py-, rapidity y = l/2\og{E + p^) / {E —pz) and azimuthal 
angle of the massless particles as the kinematic variables. When adding the massless 4- 
vectors of particles we obtain massive objects which only approximately correspond to the 
massless partons which we would like to associate with jets at tree level. 

One popular choice, the Snowmass convention leaves the question of jet mass open, 
by only defining total transverse energy, rapidity and azimuthal angle of a set of parton 
momenta, as the Et weighted sums of the individual particle variables. For the original 
massless particles Ej-i = p^i and r/j = yi. The corresponding recombined variables for a 
cluster of particles are then given by the total transverse energy 

ET = Y.ETi, (1.1) 

i 

the cluster pseudorapidity 

V = E^V., (1.2) 

and the azimuthal angle of the cluster 



= E^'/>- (1-3) 

Note that the designation of rj as pseudorapidity is purely conventional. It corresponds to 
neither the pseudorapidity nor the rapidity of the massive cluster and is approximately equal 
to either only in the limit of small cluster mass (<< Et)- The concomitant loss of Lorenz 
invariance is a serious disadvantage of the Snowmass convention. Another serious problem 
appears in resummation calculations (see Sect. H): the kinematic boundary of jet Et shifts 
(from -\/S/2 in e.g. dijet kinematics) when including additional final state partons. 

Because of these shortcomings we formulate all jet algorithms in the 4-momentum re- 
combination scheme (also called E-scheme) in the following, i.e. the kinematic variables of 
a cluster of particles is given by direct addition of the 4-momenta of the individual massless 
particles: 

P^' = {E,P^,Py,Pz)=Y,P^ . (1.4) 

i 

Since the resulting clusters have a clearly defined mass, we must distinguish transverse energy 
Et from transverse momentum p^, and pseudorapidity rj from rapidity y. We define 

P. = V^. ^ = ta.-^. . = ilogf±f. (1.5) 

Py ^ ^ Pz 

The rapidity y and azimuthal should be used as the legoplot position of the jet when 
calculating its separation from other particles or jets. Auxiliary quantities are 

Pz 

= cos~^ \ = , Et = E sine, T] = -\ogtan-. (1.6) 

/Pi +p; + pi ^ 
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1.1 The kr algorithm 



The /ct algorithm is a successive recombination algorithm. The idea is to recombine 
particles with nearly parallel momenta, beginning with the softest particles in the sample. 
This recombination stops once all clusters of particles are separated by a distance larger 
than D in the legoplot. The kx algorithm starts from a list of protojets and their momenta, 
, which in the beginning consists of the list of all particles: 

(1) For each protojet i define 

di = El^ (1.7) 
and for each pair of protojets z, j define a distance 

dij = min(E|.j, E^^ 



2 772 dy^-yif + 



JJ2 y ' 

(2) Find the smallest of all di and dij and call it dmin- 

(3) If drain IS dij then merge protojets i and j to form a new protojet k of momentum 

Vl=f^^f, (1.9) 



(4) If drain IS di then remove protojet i from the protojet list and move it to the list of 
completed jets. 

(5) Continue with step 1 until the list of protojets is empty. 

The algorithm is infrared safe because it renders all soft partons harmless: it either takes 
them off the protojet list or it combines them with nearby harder partons. The only effect 
of the soft parton then is a shift in the momentum of the recombined cluster. However, 
this shift is small, disappearing in the infrared limit, which guarantees infrared safety. Also 
for coUinear emission the algorithm is safe, because in the limit of zero angle between two 
partons, these two will have the smallest dij and will thus be combined early on in the 
recombination process, thus restoring the momentum of the almost on-shell parton from 
which they originated by splitting. Finally, every original particle is assigned to exactly one 
jet, i.e. there are no splitting/merging issues to be resolved for the fc^ algorithm. 



1.2 ILCA: an infrared safe cone algorithm 

Cone algorithms are intended to cluster all energy within a given radius, i?, around a point 
in the legoplot, to form jets. Naively, this procedure is both infrared and collinear safe: the 
effect of infrared radiation on the cluster momentum vanishes in the infrared limit, and the 
energy measured for the jet is the same whether a single particle is at the core of the cone 
or whether there has been collinear splitting. This naive expectation can easily be violated, 
however, by the prescription for selecting cones. Two examples illustrate this point |]^. 
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Assume that cones are constructed around actual energy depositions only. In Fig. |I|(a) 
two particles are emitted at a distance greater than the cone radius R but smaller than 
2R and therefore are assigned to separate cones, which are then identified as two distinct 
jets. The only difference in Fig. |I](b) is the emission of a third soft particle (a "soft gluon" ) 
between the original two particles. Now the additional cone around the soft energy deposition 
encompasses all three particles and they will be classified as a single jet. The presence of 
a soft particle changes the classification of a hard event: this is an example for an infrared 
unsafe algorithm. In perturbation theory, at sufficiently high order, an arbitrary number 
of soft gluons will be radiated, hence, cones should be allowed anywhere in phase space to 
anticipate this feature of higher order corrections. An arbitrary restriction on allowed cone 
positions may lead to an infrared unsafe algorithm. 




Figure 1: Example for a situation which can lead to infrared problems in an unsafe cone 
algorithm. 

Similarly, an infinite number of coUinear splittings occurs at higher order in perturbation 
theory. A possible collinear problem, resulting from Ex ordering of particles, is illustrated 
in Fig. 1^. The difference between the two situations is that the central (hardest) parton may 
split into two almost collinear partons. On the left-hand-side the distance between the lateral 
partons is larger than R but the three hard partons all fall within a cone of radius R around 
the central parton, which happens to have the largest Enp- As a result, all three partons are 
recombined to a single jet. Collinear splitting renders the right hand parton to be the one 
with the largest Et- Drawing the first cone around the highest Et parton will recombine 
it with the two central partons and a separate jet is likely to be assigned to the remaining 
fourth parton. A differing jet number which depends on the presence or absence of collinear 
splitting must be avoided because the incomplete cancellation of the logarithmic divergences 
of real emission and virtual contributions will lead to a collinear unsafe jet algorithm. One 
can eliminate such ambiguities by making the selection or ordering of jet definition cones 
independent of the Et of individual particles. Also, allowing trial cones anywhere in phase 
space would have made the two situations in Fig. |^ more similar: allowing cones centered 
between the central and the outside partons from the start would lead to a more similar jet 
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Figure 2: Example for a situation which can lead to collinear problems in an unsafe cone 
algorithm. 

identification in the two cases. 

The above considerations lead us to consider a cone algorithm which allows trial cones 
to be positioned anywhere in phase space, irrespective of the transverse momentum carried 
by individual particles or calorimeter cells. We start by formulating this seed-less algorithm 
at the calorimeter level, where the basic entities are calorimeter towers. 

(0) Make a list of all calorimeter towers. 

(1) Select the next tower on the list as the center of a trial cone of radius R. 
Goto (4) if the list of towers is exhausted. 

(2) Add the momenta of all towers inside the trial cone and determine the legoplot position 
{y, 0) corresponding to this momentum. 

(3) If this position is outside the selected tower, discard the trial cone and go to (1). 

If {y, 0) is inside the selected tower, add the set of towers inside the trial cone as a new 
entry to the list of protojets. 

At this stage we have a list of protojets, and we need to split/merge them to make jets. 

(4) Select the highest Et protojet remaining on the list. (If the list is exhausted jet 
identification for the event is complete.) 
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(5) Does the selected protojet share any towers with other protojets? 

(5. a) No: Move protojet to Ust of jets and continue with (4). 

(5.b) Yes: Find the highest Et protojet that shares towers with the selected protojet. 
(Call this the neighbor protojet.) Decide whether the Et in the shared cells is 
greater than a fraction / of the Et in the neighbor protojet. 

(S.b.l) No: Split the shared towers. 

(S.b.l.a) Allocate shared towers to either the selected or the neighbor protojet 
depending on which jet center is closer. 

(S.b.l. b) Calculate the new momenta for the modified protojets, i.e. their Et, and 

legoplot positions {y,(f)). Continue with step (4). 

(5.b.2) Yes: Merge the selected and neighbor protojets to form a new protojet. 

Add the momenta of both protojets and determine the total Et and and the 
legoplot position {y,(j)). Continue with step (4). 

The procedure that defines the list of protojets is infrared and collinear safe. The additional 
steps completely define how to solve the problem of overlapping cones. The critical overlap 
fraction / is a free parameter of the algorithm and may be chosen as 50%, similar to the 
DO choice in run I of the Tevatron (CDF uses 75%). The E'r-ordering of protojets in this 
split /merge step does not introduce collinear problems provided the cone size R is chosen 
sufficiently large. 

The definition of calorimeter towers, i.e. a discretization of {y, (p) space, would be cumber- 
some in a theoretical calculation, and is indeed not necessary. In a perturbative calculation 
at fixed order, the maximal number, n, of partons is fixed. The only possible positions of 
stable cones are then given by the partitions of the n parton momenta, i.e. there are at most 
2" — 1 possible locations of protojets. They are given by the legoplot positions of individual 
partons, all pairs of partons, all combinations of three partons etc. In a perturbative calcu- 
lation, e.g. via a NLO Monte Carlo program, the protojet selection of the seedless algorithm 
(steps (0) to (3) above) can then be replaced as follows: 

(0) Make a hst of aU possible cone centers. These are the legoplot coordinates of all parton 
momenta Pi, of all pairs of parton momenta Pi + Pj, of all triplets of parton momenta 
Pi +Pj +Pk, etc. For each cone center record which set of partons defines it. 

(1) Select the next cone center on the list as the center of a trial cone of radius R. 
Goto (4) if the list of cone centers is exhausted. 

(2) Add the momenta of all partons inside the trial cone and determine the legoplot posi- 
tion {y, 0) corresponding to this momentum. 

(3) If this position is different from the trial cone center, i.e. if the cone center record and 
the list of partons inside the trial cone disagree, discard the trial cone and go to (1). 
If {y, (p) is the trial cone center, add the set of partons inside the trial cone as a new 
entry to the hst of protojets. 
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As before, different protojets may share partons, i.e. they may overlap. The required 
spht /merge step is then identical to the calorimeter level steps (4) and (5), with towers 
replaced by partons as elements of protojets. 

In an actual experiment the number of calorimeter towers may be very large (order 6000 
for tower sizes of Arj x A0 = 0.1 x 0.1 and an r] coverage of ±5 units of pseudorapidity) . 
The calorimeter level algorithm may then be rather slow computationally. The question 
arises whether an acceptable approximation of the seedless algorithm can be constructed, 
analogous to the parton level short-cut, by considering only those towers which have energy 
depositions above a minimal seed threshold. One would like to replace the list of parton 
momenta above by the list of tower momenta with 

PTi > Ex^seed ■ (1-10) 

Since the algorithm is infrared and coUinear safe when Ex^seed = 0, it is always possible to 
chose the seed threshold Ex^seed low enough so that variations of Ex^seed lead to negligeable 
variations in any observable under consideration. 

One would like to include in the determination of jet momenta all towers, of course, 
which lie inside the cone of radius R around the protojet axis. This requires an additional 
iteration of the cone axis in the parton level algorithm when a seed threshold is imposed. 
The steps leading to the definition of protojets can then be modified as follows: 

(0) Make a list of all possible cone centers. These are the legoplot coordinates of all 
parton/tower momenta pi with pTi > Ex^seed, of all pairs of such parton/tower momenta 
Pi + pj, of all triplets Pi + pj + pk, etc. 

(1) Select the next cone center on the list as the center of a trial cone of radius R. 
Goto (3) if the list of cone centers is exhausted. 

(2) Add the momenta of all partons/towers inside the trial cone (also those with pTi < 
ET,seed) and determine the legoplot position (?/, 0) corresponding to this cone momen- 
tum. Use (y, 0) as the new center of the trial cone and iterate this step until the 
position is stable. The set of all towers/partons inside the final trial cone constitutes 
a new protojet. Continue with step (1). 

(3) Eliminate all duplicate protojets, i.e. protojets with an identical set of towers/partons. 

With these changes, the resulting algorithm (named Improved Legacy Cone Algorithm or 
ILCA) is quite close to those used in run I of the Tevatron. The main change is the inclusion of 
midpoints of seeds (the Pi + pj pairs) and of centers of larger numbers of seeds as additional 
seed locations for trial cones. Including these additional midpoints is absolutely crucial 
in perturbative calculations in order to achieve infrared safety (see discussion on Fig. |1[). 
When dealing with data, these effects are somewhat diminished, because with sufficiently 
low seed thresholds Ex^seed, a large number of trial cones will be generated from actual soft 
energy depositions in the calorimeter. However, because these soft energy depositions will 
decide how many jets are reconstructed, one potentially introduces a high sensitivity of jet 



138 



observables to soft hadrons, Monte Carlo modelling of soft particles etc. The inclusion of 
the extra midpoints eliminates these soft effects because observables no longer depend on 
whether soft emission actually took place. 

2 Resummed calculations 

The ILCA and /c^—algorithm eventually lead to jets whose topology is not extremely different 
from that expected on the basis of a naive definition in terms of cones in azimuth-rapidity 
space. This is obviously true for the ILCA, where jets can contain particles whose dis- 
tance is smaller than 2R and have a shape that differs from a cone-shape only because of 
the merging/splitting procedure. In the fc^^-algorithm, jets have no sharp boundaries, but 
opening angles of particles within each jet are, typically, smaller than D and all opening 
angles between jets are larger than D. The detailed jet structure is, however, different in the 
two algorithms. Although both algorithms are perturbative safe, the differences show up in 
higher-order perturbative calculations. 

Higher-order perturbative computations and, in particular, resummed calculations can 
be necessary in special kinematics configurations that lead to large logarithmically-enhanced 
contributions at any fixed order in perturbation theory. Typical examples are calculations 
of jet cross sections near the phase-space boundary and of the fine internal structure of 
jets (shape variables, subjets, etc.). These quantities can be strongly dependent on the 
jet definition. The corresponding perturbative calculations do strongly depend on the jet 
definition, because they are the result of the integration of the QCD matrix elements (which 
do not depend on jets) over phase-space regions whose boundaries depend on the fine details 
of jet kinematics. 

As an example of this strong sensitivity, we can consider the one-jet inclusive cross section 
as a function of the transverse momentum pt of the jet. // the jet variable pt is defined 
by using the 4-momentum recombination scheme (see Eqs. ( |1.4| )-( [L^ )), the kinematical 
boundary is xt < 1, where xt = 2pT/\fS- Close to the boundary ~ 1, the perturbative 
contributions are enhanced by large logarithmic corrections (a5log^(l — x^))" that need to 
be resummed to all orders in as- Techniques to perform this resummation can be developed 
(see below). However, z/the jet variable pt is defined by using a recombination scheme that 
does not conserve the 4-momentum (e.g. the true px in Eq. (|1.5| ) is replaced by the variable 
Et in Eq. (|1 . 1| ) ) , the kinematical boundary for x^, although close to x^ = 1, is not fixed: 
the corresponding large logarithms cannot be resummed because the xy-boundary shifts in 
a complicated manner depending on the number of final-state partons in the calculation @] . 

The feasibility of resummed calculations depends not only on the recombination scheme 
but also on the jet algorithm, as is well known for jets in e^e~ annihilation [^]. The jet 
definition of the fc^-algorithm is inspired by the parton shower picture of jet fragmentation . 
Thus resummed calculations can be carried out by using the analytic version of the recurrence 
techniques used to generate multiparton final states in Monte Carlo parton showers. This is 
demonstrated by explicit calculations of sub jet multiplicity and rates in hadron collisions 0. 
The ILCA has still to be investigated in this respect. The two algorithms differ only slightly 
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at highly inclusive level ^ . Thus, in these cases (such as the large-xr behaviour of the one- 
jet inclusive cross section), resummed calculations in the ILCA should be feasible as in the 
/cT-algorithm. Studies of the internal structure of ILCA jets may instead be more difficult 
since it depends on the procedure to merge/split overlapping jets. 

3 Conclusions 

During the Les Houches workshop discussions were centered on the general properties of jet 
algorithms, in particular their infrared and coUinear safety at the perturbative level. The kx- 
algorithm and the seedless cone algorithm described in Section |1.2| fulfil these requirements. 
Beyond these theoretical concerns there are many experimental issues which need to be 
addressed to obtain a practical algorithm. Among these are ease of energy calibration, effects 
of underlying event and overlapping events in a high luminosity hadron collider environment, 
high jet reconstruction efficiency, and efficient use of computer resources in reconstructing 
jets. These issues have been addressed in a parallel study, during the Run II QCD Workshop 
at Fermilab 0. In particular it has been shown that the ILCA produces small corrections 
only, when compared with the jet algorithms used in run I of the Tevatron. We refer the 
reader to the Proceedings of the Run II Workshop for a detailed study of these effects. 
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A Study of the Underlying Event in Jet and Minimum 

Bias Events 



J. Huston and V. Tano 



Abstract 

In order to determine more accurately the energy contribution in a jet cone due to 
the underlying event, and in order to understand better the ambient event environment 
at both the Tevatron and the LHC, we have studied the energy distribution in a cone 
of radius 0.7 in both jet and in minimum bias events. We have compared the results 
from CDF data from Run lb with results from HERWIG passed through the detector 
simulation program QFL |jl]. 

1 Introduction 

Due to the importance of the inclusive jet cross section as a test of perturbative QCD over 
a wide range of values, it is necessary to carefully consider all systematic effects that 
influence its measurement. In addition to the hard interaction that produces the jets in the 
final state there is also an underlying event, originating mostly from soft spectator parton 
interactions. Because of the softness of the scale, their contribution cannot be perturbatively 
calculated. There may also be a contribution due to semi — hard interactions between 
spectator partons, which create mini — jets at transverse momenta almost large enough for 
perturbative calculations, but much smaller than that of the primary interaction responsible 
for the highest Et jets in the event. This process is known as double parton scattering. Both 
of the above processes, as well as higher order radiation from the 2— i>2 hard subprocess 0, 
contribute to the underlying event. 

The experimental cross sections are most commonly compared to theoretical calculations 
at next-to-leading order (NLO) in the coupling constant as, such as JETRAD or EKS @]. 
At NLO, there can be at most 3 partons in the final state, leading to the presence of either 

2 or 3 jets, depending on whether the third parton is present in the final state and whether 
it ends up being clustered with one of the other two partons. As the jet clustering is based 
on a fixed cone algorithm, the contribution due to the underlying event must be subtracted 
from the jet cone, in order to compare the results with NLO QCD calculations. As will be 
seen below, one of the largest sources of systematic error for the inclusive jet cross section 
at low Et is due to the uncertainty on the subtraction of this underlying event. 
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The current 'paradigm' [| is that the underlying event in jet events is similar to the 
average energy level found in 'active' minimum bias events. Thus, this energy level needs to 
be determined and subtracted from the energy in a jet cone before the jet data is compared 
to NLO theory predictions. CDF assumes an uncertainty of 30% on this underlying event 
subtraction which makes it the dominant error at low Et- 

The flip side of the above 'paradigm' is that the underlying event energy in a jet event 
(once the two leading jets have been subtracted) should be the sum of the minimum bias 
level contribution and the third parton in the NLO calculation. The preliminary results of 
a study in CDF designed to test the accuracy of these assumptions are described in this 
section. 

In this section, the experimental results will be compared to the HERWIG Monte Carlo 
which has the 2^2 matrix elements for jet production, parton showering in the initial 
and final state, and a model for the underlying event. The ultimate result from HERWIG 
consists of the 4-vectors of the final state hadrons. In HERWIG the soft underlying event in 
the hadron-hadron collision is assumed to be a soft collision between the two 'beam clusters', 
which contain the spectators from the incoming hadrons. The model for the simulation of 
underlying event uses the pp event generator from the UA5 Collaboration, which is modified 
to make use of the HERWIG fragmentation algorithm. 

The results from HERWIG can be quoted at the parton level (excluding the soft underlying 
event portion), the hadron level and/or comparisons can be made at the detector level after 
the Monte Carlo hadrons are passed through the CDF detector simulation program QFL 
0. For most of the results that we will be reporting, the QFL comparisons will be crucial. 

To summarize, the purpose of this analysis is to examine in detail the underlying event 
in jet events and to understand whether the amount subtracted from the jet cones is correct 
and whether the uncertainty assumed can be reduced. In addition, a test will be made as 
to how well HERWIG models the underlying event energy in jet events as well as in minimum 
bias events. 

2 Underlying Energy at 90° in jet events 

Events generated with HERWIG were passed through QFL. The HERWIG code was adapted to 
produce the same information as found in the data samples. This information includes the 
energy, the position and the number of calorimeter towers of the jets in the event, together 
with the energy and the number of towers in two cones situated at ±90° in and at the 
same t] as the leading jet. and rj are respectively the azimuthal and polar angle. 

For each jet event, two cones of radius 0.7 at r] = rjieadjet and = (j) Lead jet ± f were 
examined. The energy in each cone was determined for two different calorimeter tower 
thresholds: 50 and 100 MeV. A cut of 100 MeV on tower energies is typically used for jet 
analyses. For most of the comparisons to follow, a 50 MeV cut was used, though, since 
we are interested in possible contributions to the tower energies from a number of different 
sources. The two cones were used to study the underlying event energy because they are 

^Dave Soper claims that this term is vastly overused but we choose to employ it anyway. 
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Figure 1: An example of the cones under investigation in the central calorimeter region. The 
dotted cones are at r/ = rjieadjet and = (pLeadjet ± f , while dashed cones are at i] = ~r]Leadjet 



and 



^Leadjet i 2 ' 



\r]\ < 1., 0" < < 360^ 



supposed to be in a semi-quiet region, far away from the two leading jets, but still in the 
central rapidity region. Given the non-uniform response of the CDF detector as a function 
of rapidity, the latter criterion is essential. The leading jet was required to be in the central 
region, Ir^l < 0.7, the same as in the inclusive jet analysis. No requirement was made on the 
location of the second jet. In Fig |1| the calorimeter central region is shown as 'unrolled'; t] 
ranges are between -1 and +1, while goes from 0° to 360°. The leading jet cone and the 
two cones under study are shown. 

The Et distributions inside the two cones provide an idea of the contribution of the 
underlying event in the jet cone. For each event the cone which has the maximum energy and 
the cone with the minimum energy were labelled. This is useful because NLO perturbative 
corrections to the 2 — > 2 hard scattering can contribute only to one of these two regions 
10]. The difference between the maximum and the minimum cone provides information on 
this contribution, while the minimum cone gives an indication of the amount of underlying 
event. The (roughly constant) underlying event contribution should be suppressed in the 
difference. 

The data were required to have one and only one vertex in order to insure that there is 
only one interaction per event. A similar cut was made in the simulation. In Figure |^ the 
transverse energy inside the two cones (max and min) is plotted as a function of the Ej- of 
the leading jet. It can be clearly observed that HERWIG and the data have a similar behaviour 
for the max and min cone; the min cone stays flat while the max cone increases with the 
Et of the leading jet. The increase of the max cone energy with increasing jet Et is easily 
understandable. What may be surprising is the flatness of the min cone energy as the lead 
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Figure 2: Et inside the max and min cones as a function of the Et of leading jet. Both the 
data and HERWIG distributions are plotted. 

jet transverse energy increases. Contrary to the pronouncements of some of the pohticians 
of our day, a rising tide does not raise all boats (or cones), but instead favors the cone in the 
highest tax bracket. Of course, the division into a max and min cone partially encourages 
this effect through selection. However, the level of flatness is still somewhat surprising. 

It is evident that there is an offset between data and the HERWIG+QFL simulation of 
about 800 MeV for the max cone and 500 MeV for the min cone. If the tower threshold is 
increased from 50 to 100 MeV, the transverse energy decreases by about 180 MeV in the 
data (both cones), while in HERWIG the transverse energy decreases by about 70 MeV in the 
max cone and 40 MeV in the min cone. 

The difference between the transverse energy in the max and in the min cones has a 
similar trend in both data and simulation (Figure There is still an offset but the offset 
decreases to about 300 MeV. It appears that the max-min distribution starts going down 
again at very high Et (perhaps due to kinematic suppression), although the statistics become 
poor. 

In Figure ^ the Et frequency distributions for data and HERWIG+QFL are compared for 
four different jet sub-samples. In this plot, the Et values for max-min are plotted, for both 
data and HERWIG+QFL. The number of entries is scaled to easily allow a direct comparison. 
The Et distribution of the max-min cone for HERWIG+QFL looks very similar to that of the 
data. Here the contribution of the underlying event as minimum bias data is presumably 
removed. 
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Figure 3: The difference between Et inside the max and min cones as a function of the of 
the leading jet. Both the data and HERWIG distributions are plotted. 
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Figure 4: The frequency distribution for the difference between E^ in the max cone and Ex in 
the min cone. Solid line: data, dashed line: HERWIG. The calorimeter tower energy threshold is 
50 MeV. 
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Figure 5: The parton-hadron-detector level for Et in the max cone is plotted as a function of 
the Et of the leading jet. The underlying event in HERWIG is switched off. 

2.1 Parton-Hadron-Detector level 

With HERWIG (unlike the data), we have the advantage of being able to examine the energy 
distributions not only at the detector level, but also at the hadron and parton levels. The 
HERWIG model for the soft underlying event, though, does not show any effect at the parton 
level because the energy contribution is calculated directly at the hadron level. In the 
following discussion, in order to examine the differences between hadron, detector and parton 
level, the underlying event in HERWIG has been switched off. 

Figures |^ and ^ show the transverse energy inside the max and min cones at rj = rjLeadjet 
and (p = (pLeadjet ± | as a function of the leading jet transverse energy at the parton, hadron 
and detector level. The lead jet is always in the central region. Because of the degradation 
due to the detector response, the amount of energy is higher at the hadron level than at the 
detector level. 

It is also interesting to note that the hadron level energy is larger than the parton level 
energy, by the order of several hundred MeV. This is due to hadronization effects of the 
partons produced in or near the lead and second jet cones. Most of the hadronization effects 
come from resonance production (p, Ai, A2, ...) and their subsequent decays. The hadroniza- 
tion effects from the partons inside the jet cone have previously been termed "splashout" . 
It is important to note that this splashout is not currently taken into account in either the 
CDF or DO jet analyses. Both experiments implicitly assume that the hadron and parton 
levels produce the same energy in the jet cone. This is especially relevant for low Et jet 
production. 

In order to evaluate to what level resonance decays influence the energy inside the two 
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Figure 6: The hadron-parton-detector level for Et in the min cone is plotted as a function of 
the Et of the leading jet. The underlying event in HERWIG is switched off. Note that the Et at 
the hadron level is always greater than the Et at the parton level. 

cones at 90° from the leading jet, all resonance decays are switched off and the energy in the 
cones were examined at both the parton and the hadron level. The difference of Et inside 
the min cone (between hadron and parton level) decreases from an average of 300 MeV to 
100 MeV, while the difference in the max cone goes from 500 MeV to 100 MeV. 

3 Underlying Energy in minimum bias events 

The model used in HERWIG to simulate minimum bias events is the same as used for the soft 
underlying energy in hard scattering events. Minimum bias events were studied in order to 
see if the reason for the offset observed between the data and simulation results from the 
HERWIG description of the soft underlying event. The minimum bias events generated with 
HERWIG were passed through the detector simulation program QFL and the information on 
the energy released in the calorimeter towers stored. 

The amount of transverse energy in the calorimeter, in a random cone of radius 0.7 
that is required to be in the central region (|?7| < 0.7), was determined. The transverse 
energy distributions for the two different tower thresholds are summarized in Table |l| where 
a comparison with data also can be found. 

The offset of about 650 MeV between data and HERWIG is slightly higher than the one 
found comparing the min cones in the jet events. 
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Table 1: A comparison of data and HERWIG for minimum bias events. The average amount of 
transverse energy in a cone of radius 0.7 is shown. Thresholds are in Mev, results are in GeV. 



Thresholds 


DATA 


HERWIG 


DATA-HERWIG 


50 


1.05 


.37 


.68 


100 


.92 


.35 


.57 



4 Et summed in the central region (Swiss Cheese) 

For these comparisons, the transverse energy in every calorimeter tower in the central region 
{\rj\ < 1) is summed, excluding the towers in a radius 0.7 from the center of the two (or 
three) most energetic jets in the event: 



Ej-itowersi 



Sumof Et= J2 J2 

towers 2/3jets Ltowers 

where g^^g q\\ ^j-^g towers in a radius 0.7 from the center of the jet. We require 

Erjet > 5 GeV. This configuration has been labelled 'Swiss cheese' 0. 

There are an average of between 2 and 2.5 jets in the central rapidity region, with this 
average having a slight slope as a function of the lead jet transverse energy. The Swiss 
cheese energy in the central region is plotted in Fig |^ at the hadron, parton and detector 
level. The approximate minimum bias level for HERWIG and data is shown with a fiat line 
on the picture. In the simple picture presented earlier, and on which the CDF and DO jet 
analyses are based, the difference between the Swiss cheese energy with two jets subtracted 
and the minimum bias level should be proportional to the NLO (third parton) contribution. 
The Swiss cheese level with three jets subtracted should have little or no NLO contribution 
and can be directly compared to the minimum bias data level. The 3-jet subtracted Swiss 
cheese energy is larger than the minimum bias level and there is a small slope as a function 
of the lead jet Et (the offset varies from 6-8 GeV over the Et range). This indicates perhaps 
that there is more complexity here than in the simple picture. Other possible contributions 
to the Swiss cheese energy include hadronization from the jets ("splashout"), double parton 
scattering and higher order radiation effects. 

As was done for the min and max cone studies, the underlying event in HERWIG can 
be switched off and the hadron/detector level in the Swiss Cheese plots compared when 
the resonance decay is not allowed. At the hadron level, when the resonance decay is not 
allowed, we find about 1.5 GeV energy less then when allowing the resonance decay. This 
implies a 600-700 MeV contribution of splashout per jet (again at the detector level) to the 
Swiss cheese energy. 

The comparison of the Swiss cheese results for the data and HERWIG+QFL is complex 
and its interpretation is continuing. 
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Figure 7: Sum of Et- The two and three most energetic jets in the events are subtracted from 
the total transverse energy in the central calorimeter region. Both data and HERWIG results are 
shown. 



5 Conclusions and where do we go from here 

The energy from the underlying event is not perturbatively calculable and must be subtracted 
from a jet cone in order for comparisons to be made to NLO calculations. Because of the 
ambiguous definition of what constitutes this underlying event, a relatively large uncertainty 
has been assigned to the value of this subtraction. In order to study the underlying event, 
we have considered two cones in the calorimeter far away from the leading jet and we 
examined the energy in the cones, both in the CDF data and with the HERWIG simulation. 
We discovered that both the data and HERWIG exhibited a similar behaviour for the max 
and the min cone; the min cone stays flat, while the max cone increases as a function of 
the leading jet Et- There is an offset, however, of about 500 MeV for the min and of 800 
MeV for the max cone between data and HERWIG. If we examine the difference between the 
max and min cones, where the underlying event energy contribution should be minimized, 
we find very similar distributions for data and HERWIG. In minimum bias, the HERWIG model 
predicts a level of energy substantially below the one found in minimum bias data (400 MeV 
compared to 1 GeV). Part of this difference is due to the lack of any kind of hard interaction 
in the minimum bias model. 

With HERWIG we investigated max/min cone distributions at the parton, hadron and 
detector level and we found out that the energy inside the cones is higher at the hadron then 
at the parton level. This is mainly due to resonance decay. 

An improved understanding of the underlying event is desired for a number of reasons: 

• The underlying event subtraction is the largest uncertainty for the jet cross section 
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at low transverse energy (below 60 GeV). In order to have a good comparison of the 
data with theory, a better understanding of the proper level of this subtraction must 
be obtained. This uncertainty is especially important for the measurement of the jet 
cross section at 630 GeV, since most of the data points are below 60 GeV, and similar 
considerations to those at 1800 GeV also apply. 

• This analysis probes the interface between perturbative and non-perturbative QCD, 
an arena where a great deal of work still needs to be done. 

• The authors of the Monte Carlo programs are trying to predict the environments for 
physics measurements at the LHC. This can be difficult/uncertain without the proper 
understanding of what is happening at the Tevatron. 

This analysis will be extended to the jet and minimum bias data taken at 630 GeV 
by CDF. It will be especially interesting to observe the level of agreement of the HERWIG 
minimum bias predictions with the CDF data, given that the HERWIG model parameters 
were determined from the UA5 taken at a similar energy. It may be that there is an increase 
in the semi-hard component of minimum bias energy when going from 630 to 1800 GeV. 
After the comparisons at 630 and 1800 GeV are complete, extrapolations will made made 
to LHC energies for the underlying event in both jet and minimum bias events. 
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Isolated Photon Production 



S. Frixione and W. Vogelsang 



1 Isolated-photon production 

1.1 General features of photon production at colliders 

When mentioning the photon in the framework of high-energy colhder physics, one is imme- 
diately led to think - with good reasons - to Higgs searches through the gold-plated channel 
H — > 77. However, the production of photons also deserves attention on its own. Firstly, a 
detailed understanding of the continuum two-photon production is crucial in order to clearly 
disentangle any Higgs signals from the background. Secondly, in hadronic collisions, where 
a very large number of strong-interacting particles is produced, photon signals are relatively 
clean, since the photon directly couples only to quarks. Therefore, prompt-photon data can 
be used to study the underlying parton dynamics, in a complementary way with respect to 
analogous studies performed with hadrons or jets. For the same reason, these data represent 
a very important tool in the determination of the gluon density in the proton, g{x). Indeed, 
in recent years almost all the direct information (that is, not obtained through scaling viola- 
tions as predicted by Altarclli-Parisi equations) on the intermediate- and high-a; behaviour 
of g{x) came from prompt-photon production, pp — > 7X and pN — > 7X, in fixed-target 
experiments. The main reason for this is that, at leading order, a photon in the final state 
is produced in the reactions qg jq and qq — > ■jg, with the contribution of the former 
subprocess being obviously sensitive to the gluon and usually dominant over that of the 
latter. It is the 'point-like' coupling of the photon to the quark in these subprocesses that is 
responsible for a much cleaner signal than, say, for the inclusive production of a 7r°, which 
proceeds necessarily through a fragmentation process. 

There is, however, a big flaw in the arguments given above. In fact, photons can also 
be produced through a fragmentation process, in which a parton, scattered or produced in 
a QCD reaction, fragments into a photon plus a number of hadrons. The problem with the 
fragmentation component in the prompt-photon reaction is twofold: first, it introduces in the 
cross section a dependence upon non-perturbative fragmentation functions, similar to those 
relevant in the case of single-hadron production, which are not calculable in perturbative 
QCD and arc, at present, very poorly determined by the sparse LEP data available. Sec- 
ondly, all QCD partonic reactions contribute to the fragmentation component; thus, when 
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addressing the problem of the determination of the gluon density, the advantage of having 
a priori only one partonic reaction {qq 'yg) competing with the signal {qg 7g) is lost, 
even though some of the subprocesses relevant to the fragmentation part at the same time 
result from a gluon in the initial state. 

The relative contribution of the fragmentation component with respect to the direct 
component (where the photon participates in the short-distance, hard-scattering process) is 
larger the larger the center-of-mass energy and the smaller the final-state transverse momen- 
tum at the LHC, for transverse momenta of the order of few tens of GeV, it can become 
dominant. However, here the situation is saved by the so-called 'isolation' cut, which is 
imposed on the photon signal in experiments. Isolation is an experimental necessity: in a 
hadronic environment the study of photons in the final state is complicated by the abundance 
of 7r°'s, eventually decaying into pairs of 7's. The isolation cut simply serves to improve the 
signal-to-noise ratio: if a given neighbourhood of the photon is free of energetic hadron 
tracks, the event is kept; it is rejected otherwise. Fortunately, by requiring the photon to 
be isolated, one also severely reduces the contribution of the fragmentation part to the cross 
section. This is because fragmentation is an essentially coUinear process: therefore, photons 
resulting from parton fragmentation are usually accompanied by hadrons, and are therefore 
bound to be rejected after the imposition of an isolation cut. 

Thus, the fragmentation contribution, that threatened to spoil the cleanliness of the 
photon signals at colliders, is relatively well under control in the case of isolated-photon 
cross sections. There is of course a price to pay for this gain: the isolation condition poses 
additional problems in the theoretical computations, which are not present in the case of 
fully-inclusive photon cross sections. This topic will be the argument of the next subsection. 

1.2 Isolation prescriptions 

Consistently with what written above, we write the cross section for the production of an 
isolated-photon in hadronic collisions as follows: 

daAB{KA, Kb; K^) = 

J dXidx2ft^\xi, Hp) fl;^\x2, ^lF)dcr'^^i^{xiKA, X2Kb; Ky, /i^, /J^f, fJ'-y) 

+ J dxidx2dzflf'\xi, iJ,'p)fl^\x2, iJ,'p)dal^^°l.{xiKA, X2KB] K^/z; /i'^, /i'^, ^i^)D^\z, /i^)(l.l) 

where A and B are the incoming hadrons, with momenta Ka and Kb respectively, and a sum 
over the parton indices a, h and c is understood. In the first term on the RHS of eq. ( |1.1| ) (the 
direct component) the subtracted partonic cross sections da^^^l^ get contributions from all the 
diagrams with a photon leg. On the other hand, the subtracted partonic cross sections (ia^l"' 
appearing in the second term on the RHS of eq. (|1.1|) (the fragmentation component), get 
contribution from the pure QCD diagrams, with one of the partons eventually fragmenting 

^Actually, in the fixed-target pp —> reaction, one can see the fragmentation component increasing 
relatively to the direct one also at very large p^^, because of the direct cross section dying out very quickly 
at such momenta. This effect is of no plienomenological relevance at the LHC. 
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in a photon, in a way described by the parton-to-photon fragmentation function D^'^-'. As the 
notation in eq. (|1 . 1| ) indicates, the isolation condition is embedded into the partonic cross 
sections. 

It is a well-known fact that, in perturbative QCD beyond leading order, and for all the 
isolation prescriptions known at present, with the exception of that of ref. 0, neither the 
direct nor the fragmentation components are separately well defined at any fixed order in 
perturbation theory: only their sum is physically meaningful. In fact, the direct component 
is affected by quark-to-photon collinear divergences, which are subtracted by the bare frag- 
mentation function that appears in the unsubtracted fragmentation component. Of course, 
this subtraction is arbitrary as far as finite terms are concerned. This is formally expressed 



in eq. (|1.1| ) by the presence of the same scale /i^ in both the direct and fragmentation 
components: a finite piece may be either included in the former or in the latter, without 
affecting the physical predictions. The need for introducing a fragmentation contribution is 
physically better motivated from the fact that a QCD hard scattering process may produce, 
again through a fragmentation process, a p meson that has the same quantum numbers as 
the photon and can thus convert into a photon, leading to the same signal. 

As far as the isolation prescriptions are concerned, here we will restrict to those belonging 
to the class that can be denoted as 'cone isolations' In the framework of hadronic 
collisions, where the need for invariance under longitudinal boosts suggests not to define 
physical quantities in terms of angles, the cone is drawn in the pseudorapidity-azimuthal 
angle plane, and corresponds to the set of points 



Cr = iv, 0) I v/(^-^7)' + (0-07)' <R\^ (1-2 



where rj^ and (p^ are the pseudorapidity and azimuthal angle of the photon, respectively, 
and R is the aperture (or half-angle) of the cone. After having drawn the cone, one has to 
actually impose the isolation condition. We consider here two sub-classes of cone isolation, 
whose difference lies mainly in the behaviour of the fragmentation component. Prior to 
that, we need to define the total amount of hadronic transverse energy deposited in a cone 
of half- angle R as 

n 

ET,had{R) = X] ^TiOiR — R^i), (1.3) 
1=1 

where 



R^i = V - ^7)' + - '/'t)^ (1-4) 
and the sum runs over all the hadrons in the event (or, alternatively, i can be interpreted 
as an index running over the towers of a hadronic calorimeter). For both the isolation 
prescriptions we are going to define below, the first step is to draw a cone of fixed half-angle 



Rq around the photon axis, as given in eq. ( |1.2| ). We will denote this cone as the isolation 
cone. 

Definition A. The photon is isolated if the total amount of hadronic transverse energy in 
the isolation cone fulfils the following condition: 

ET,had{Ro) < (^cPtj, (1-5) 
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where ec is a small number, and is the transverse momentum of the photon. 



Definition B. The photon is isolated if the following inequality is satisfied: 



(1.6) 



for all the cones lying inside the isolation cone, that is for R < Rq. The function y is 
arbitrary to a large extent, but must at least have the following property: 



and being different from zero everywhere except for R = 0. 

Definition A was proven to lead to an infrared-safe cross section at all orders of perturbation 
theory in ref. P]. The smaller ec, the tighter the isolation. Loosely speaking, for vanishing 

the direct component behaves like logec, while the fragmentation component behaves 
like ec logec. Thus, for ec ^ eq. ( p!TT|) diverges. This is obvious since the limit ec ^ 
corresponds to a fully-isolated-photon cross section, which cannot be a meaningful quantity, 
whether experimentally (because of limited energy resolution) or theoretically (because there 
is no possibility for soft particles to be emitted into the cone). 

Definition B was proposed and proven to lead to an infrared-safe cross section at all 
orders of perturbation theory in ref. P] . Eq. (|1.7|) implies that the energy of a parton falling 
into the isolation cone Cr^ is correlated to its distance (in the 77-^ plane) from the photon. In 
particular, a parton becoming coUinear to the photon is also becoming soft. When a quark is 
collinear to the photon, there is a collinear divergence; however, if the quark is also soft, this 
divergence is damped by the quark vanishing energy. When a gluon is collinear to the photon, 
then either it is emitted from a quark, which is itself collinear to the photon - in which case, 
what was said previously applies - or the matrix element is finite. Finally, it is clear that the 
isolation condition given above does not destroy the cancellation of soft singularities, since a 
gluon with small enough energy can be emitted anywhere inside the isolation cone. The fact 
that this prescription is free of final-state QED collinear singularities implies that the direct 
part of the cross section is finite. As far as the fragmentation contribution is concerned, in 
QCD the fragmentation mechanism is purely collinear. Therefore, by imposing eq. (|1.6|) , one 
forces the hadronic remnants collinear to the photon to have zero energy. This is equivalent 
to saying that the fragmentation variable z is restricted to the range z = 1. Since the 
parton-to-photon fragmentation functions do not contain any 6{1 — z), this means that the 
fragmentation contribution to the cross section is zero, because an integration over a zero- 
measure set is carried out. Therefore, only the first term on the RHS of eq. ( |1 . 1|) is different 
from zero, and it does not contain any /i^ dependence. 

We stress again that the function 3^ can be rather freely defined. Any sufficiently well- 
behaved function, fulfilling eq. (P!T7|), could do the job, the key point being the correlation 
between the distance of a parton from the photon and the parton energy, which must be 
strong enough to cancel the quark-to-photon collinear singularity. Throughout this paper, 
we will use 



lim y{R) = 0, 





n = 1. 
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We also remark that the traditional cone-isolation prescription, eq. ( |1.5| ), can be recovered 
from eq. ( |1.6|) by setting 3^ = 1 and = ec- 

1.3 Isolated photons at the LHC 

In this section, we will present results for isolated-photon cross sections in pp collisions at 
14 TeV. These results have been obtained with the fully-exclusive NLO code of ref. [^], and 
are relevant to the isolation obtained with definition B; the actual parameters used in the 
computation are given in eq. (|1.8|) , together with = 1. We let Rq = 0.4. We will comment 
in the following on the outcome of definition A. 

Any sensible perturbative computation should address the issue of the perturbative sta- 
bility of its results. A rigorous estimate of the error affecting a cross section at a given order 
can be given if the next order result is also available. If this is not the case, it is customary 
to study the dependence of the physical observables upon the renormalization (/x^) and fac- 
torization (fip) scales. It is important to stress that the resulting spread should not be taken 
as the 'theoretical error' affecting the cross section; to understand this, it is enough to say 
that the range in which and fip are varied is arbitrary. Rather, one should compare the 
spread obtained at the various perturbative orders; only if the scale dependence decreases 
when including higher orders the cross section can be regarded as perturbatively stable and 
sensibly compared to data. 

Usually, /ifl and fip are imposed to have the same value, fi, which is eventually varied. 
However, this procedure might hide some problems, because of a possible cancellation be- 
tween the effects induced by the two scales. It is therefore desirable to vary /i^ and fip 
independently. Here, an additional problem arises at the NLO. The expression of any cross 
section in terms of fi (that is, when fi^ = fip) is not ambiguous, while it is ambiguous if 
fJ'R 7^ fJ'F- In fact, when /i^ ^ fip, the cross section can be written as the sum of a term 
corresponding to the contribution relevant to the case fi^ = fip, plus a term of the kind: 

as(yUA) i3(as(Aifl)) log— , (1.9) 

fip 

where B has the same power of as as the LO contribution, say a^. The argument of the as 
in front of eq. (|1.9D, fi^, can be chosen either equal to fi^ or equal to fip, since the difference 
between these two choices is of NNLO. Thus, it follows that the dependence upon or fip 
of a NLO cross section reflects the arbitrariness of the choice made in eq. ( p..9[ ), which is 
negligible only if the NNLO (as"*"^) corrections are much smaller than the NLO ones (ag"*"^). 
This leads to the conclusion that a study of the dependence upon //^ or fip only can be 
misleading. In other words: B in eq. ( |1.9|) is determined through RG equations in order to 
cancel the scale dependence of the cross section up to terms of order ag"*"^. This happens 



regardless of the choice made for /z^ in eq. ( |1.9| ). However, here we are not discussing 
the cancellation to a given perturbative order of the effects due to scale variations; we 
are concerned about the coefficient in front of the 0{a^'^^) term induced by such variations, 
whose size is dependent upon the choice made for //^ and therefore, to some extent, arbitrary. 
We have to live with this arbitrariness, if we decide to vary /i^ or fip only. However, we 
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MRST99 


CTEQ5 






1 


2 


3 


4 


5 


M 


HJ 




NLO, \ri^\ < 2.5 


23.78 


23.20 


24.19 


22.07 


25.49 


25.10 


24.61 


+U.U68 
-0.057 


LO, \r]^\ < 2.5 


10.34 


10.07 


10.52 


9.875 


10.78 


10.91 


10.66 


+0.090 
-0.072 


NLO, \ri^\ < 1.5 


14.59 


14.23 


14.88 


13.66 


15.53 


15.35 


15.01 


+0.068 
-0.056 


LO, \r]^\ < 1.5 


6.457 


6.270 


6.583 


6.212 


6.657 


6.771 


6.596 


+0.091 
-0.073 



Table 1: Isolated-photon cross sections (nh), with 40 < < 400 GeV, in two dif- 
ferent rapidity ranges, for various parton densities. The scale dependence, evaluated 
according to eq. is also shown. 



can still vary yUj, and fip independently, but eventually putting together the results in some 
sensible way, that reduces the impact of the choice made for /i^. In this section, we will 
consider the quantities defined as follows: 




± 



(T(/ifl = /io, /iF = /io) - Cr(AtH = «±/Lto, /if = /io) 



+ 



cr{fiji = /io, fip = /io) + (^(/ifl 
aiflR = /io, /iF = yUo) - 0-(/ifl 



a-t/iQ, fip — /io) 
/io,/iF = a±fio) 



a(/ifl = /Lto, fJ'F = l^o) + o-{fiR = /io, fJ'F = «±/io) 



;i.io) 



where a+ and a_ = l/a+ are two numbers of order one, which we will take equal to 1/2 and 
2 respectively; the ± sign in front of the RHS of eq. (|1.10| ) is purely conventional. We can 
evaluate {6a/a)± by using /i^ = /i^ or /i^ = fip in eq. (p..9|). The reader can convince himself, 
with the help of the definition of the QCD /3 function, that the difference between these two 
choices is of order in the expansion of the contribution to {6a /a)"^ due to eq. (T~L); on the 
other hand, this difference is only of order in each of the two terms under the square root 
in the RHS of eq. ( |1.10| ). This is exactly what we wanted to achieve: a suitable combination 
of the cross sections resulting from independent /i^ and fip variations is less sensitive to the 
choice for /i^ made in eq. ( |1.9| ) with respect to the results obtained by varying /i^^ or fip only. 

In table |l] we present the results for the total isolated-photon rates, both at NLO and at 
LO. The latter cross sections have been obtained by retaining only the lowest order terms 
((9(aemtts)) in the short- distance cross section, and convoluting them with NLO-evolved 
parton densities. Also, a two-loop expression for as has been used. There is of course 
a lot of freedom in the definition of a Born-level result. However, we believe that with 
this definition one has a better understanding of some issues related to the stability of the 
perturbative series. In order to obtain the rates entering table |1|, we required the photon 
transverse momentum to be in the range 40 < pr-y < 400 GeV, and we considered the rapidity 
cuts |?7^| < 1.5 and \r]^\ < 2.5, in order to simulate a realistic geometrical acceptance of the 
LHC detectors. We first consider the scale dependence of our results (last column), evaluated 
according to eq. ( |1.10| ). We see that the NLO results are clearly more stable than the LO 
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ones; this is reassuring, and implies the possibihty of a sensible comparison between NLO 
predictions and the data. Notice that the size of the radiative corrections {K factor, defined 
as the ratio of the NLO result over the LO result) is quite large. From the table, we see that 
the cross sections obtained with different parton densities differ by 6% at the most (relative 
to the result obtained with MRST99-1 0, which we take as the default set). MRST99 sets 
2 and 3 are meant to give an estimate of the effects due to the current uncertainties affecting 
the gluon density, whereas sets 4 and 5 allow to study the sensitivity of our predictions 
to the value of a^iM^) (sets 1, 4 and 5 have Af^ =220, 164 and 288 MeV respectively). 
On the other hand, the difference between MRST99-1 and CTEQ5M [|] results is due to 
the inherent difference between these two density sets (CTEQ5M has N^^ =226 MeV, and 
therefore the difference in the values of as{Mz) plays only a very minor role). 

From inspection of table |1], we can conclude that isolated-photon cross section at the 
LHC is under control, both in the sense of perturbation theory and of the dependence upon 
non- calculable inputs, like a^^Mz) and parton densities. The relatively weak dependence 
upon the parton densities, however, is not a good piece of news if one aims at using photon 
data to directly access the gluon density. On the other hand, the expected statistics is large 
enough to justify attempts of a direct measurement of such a quantity. In the remainder of 
this section, we will concentrate on this issue. We will consider 



where x is any observable constructed with the kinematical variables of the photon and, 
possibly, of the accompanying jets, a and o"o are the cross sections obtained with two 
different sets of parton densities, the latter of which is always the default one (MRST99-1). 
We can imagine a gedanken experiment, where it is possible to change at will the parton 
densities; in this way, we can assume the relative statistical errors affecting a and ctq to 
decrease as l/\fN and l/v^iVo, N and Nq being the corresponding number of events. It is 
then straightforward to calculate the statistical error affecting IZx] by imposing IZx to be 
larger than its statistical error, one gets 



where C is the integrated luminosity, e < 1 collects all the experimental efficiencies, and 



7^x 



doQ jdx — da jdx 



(1.11) 



(io"o /dx + da/dx'' 




1 



(1.12) 




(1.13) 
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Figure 1: Dependence of isolated-photon and isolated-photon-plus-jet cross section 
upon parton densities, as a function of 

P'T'y Oj'^d Xryj . 



In this equation V S* is the center-of-mass energy of the colhding hadrons, and prj and r/j 
are the transverse momentum and rapidity of the hardest jet recoihng against the photon. 
In order to reconstruct the jets, we adopted here a fc-r-algorithm, namely that proposed 
in ref. []^, with D = 1. Notice that x^j exactly coincides at the leading order with the 
Bjorken-x of the partons in one of the incoming hadrons; NLO corrections introduce only 
minor deviations. For all the density sets considered, the dependence of TZ upon pT-y is rather 
mild. The values in the 1ow-Pt7 region could also be inferred from table |l|, since the cross 
section is dominated by small p^^'s. Analogously to what happens in the case of total rates, 
the sets MRST99-4 and MRST99-5 give rise to extreme results for Tlpj,^, since the value of 
^QCD is quite different from that of the default set. From the figure, it is apparent that, by 
studying the transverse momentum spectrum, it will not be easy to distinguish among the 
possible shapes of the gluon density. On the other hand, it seems that, as far as the statistics 
is concerned, a distinction between any two sets can be performed. Indeed, the symbols in 
the figure display the quantity defined in eq. ( [1.12|) , for C = 100 fb~^, Ap^-y = 10 GeV and 



e = 1. Of course, the latter value is not realistic. However, a smaller value (leading to a 
larger (TVjmin), can easily be compensated by enlarging Aprpj and by the fact that the total 
integrated luminosity is expected to be much larger than that adopted in fig. ^ 

Turning to the right panel of fig. |l], we can see a much more interesting situation. Actu- 
ally, it can be shown that the pattern displayed in the figure is rather faithfully reproduced 
by plotting the analogous quantity, where one uses the gluon densities instead of the cross 
sections. This does not come as a surprise. First, x^j is in an almost one-to-one corre- 
spondence with the X entering the densities. Secondly, photon production is dominated by 
the gluon-quark channel, and therefore the cross section has a linear dependence upon g{x), 
which can be easily spotted. It does seem, therefore, to be rather advantageous to look at 
more exclusive variables, like photon-jet correlations (this is especially true if one considers 
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the procedure of unfolding the gluon density from the data: in the case of single-inclusive 
variables, the unfolding requires a de-convolution, which is not needed in the case of cor- 
relations). Of course, there is a price to pay: the efficiency e will be smaller in the case 
of photon-jet correlations, with respect to the case of single-inclusive photon observables, 
mainly because of the jet-tagging. However, from the figure it appears that there should be 
no problem with statistics, except in the very large x^j region. 

Finally, we would like to comment on the fact that, for the case of single-inclusive photon 
observables, we also computed the cross section by isolating the photon according to defini- 
tion A, using ec = 2 GeV/pTy. The two definitions return a pr-y spectrum almost identical 
in shape, with definition B higher by a factor of about 9%. It is only at the smallest pr^y 
values that we considered, that definition B returns a slightly steeper spectrum. The fact 
that such different definitions produce very similar cross sections may be surprising. This 
happens because, prior to applying the isolation condition, partons tend to be radiated close 
to the photon; therefore, most of them are rejected when applying the isolation, no matter 
of which type. This situation has already been encountered in the production of photons at 
much smaller energies. The reader can find a detailed discussion on this point in ref. 
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Direct photon pair production at colliders, 
an irreducible background to Higgs boson searches at 

the LHC 

T. BiNOTH, J. P. GuiLLET, V.A. Ilyin, E. Pilon and M. Werlen 



Abstract 

Direct Q photon pairs with large invariant mass are the so-cahed irreducible back- 
ground in the search for Higgs bosons at the LHC in the channel h — > 77, in the mass 
range 80 — 140 GeV/c^. This huge background requires an understanding and quan- 
titative evaluation. Photon pair production at the Tevatron offers the opportunity to 
already test our understanding of this process. In the same mass range, the produc- 
tion of a Higgs boson {hrightarrow^^) in association with a hard jet at the LHC is 
a promising channel, as the corresponding 77+ jet background may be under better 
control. 

1 Role and relevance of higher order corrections 

Our theoretical understanding of direct photon pair production (as any hard hadronic pro- 
cess, cf. [^) is based on the QCD improved parton model, according to which long and short 
distance effects factorize from each other. Short distance subprocesses are safely computed 
in perturbative QCD. Long distance effects cannot be completely calculated from QCD at 
present, although their scaling violations can be. Instead, they are extracted from experi- 
mental data and encoded into non-perturbative quantities, such as the parton distribution 
functions in incoming hadrons and, if necessary, inclusive fragmentation functions of partons 
into observed outgoing particles, e.g. photons. Yet these quantities are universal, i.e. inde- 
pendent of the hard subprocess: schematically they can be measured in one process, then 
transported to predict another one. 

However, the border between short and long distance scales is arbitrary. The separation 
requires the introduction of unphysical parameters; e.g. the factorization scale M^, and 
similarly the fragmentation scale M| in the case of a fragmentation process. In an ideal 
exact calculation, the dependence on these spurious parameters (as well as on the arbitrary 
renormalization scale /x^) would cancel between the short and long distance parts. In an 
actual expansion in powers of truncated at some finite order, this cancellation is only 

^ "Direct" or "prompt" mean that these photons do not result from the decay of 7r° and ry. 
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partial; it holds up to a term of the lowest uncalculated order. As the order of truncation 
increases, theoretical estimates become flatter and flatter over broader and broader ranges 
of these spurious scales. The uncertainty induced by this actual dependence restricts the 
accuracy and predictive character of QCD calculations. In particular, the result of a lowest 
order calculation is plagued by a large monotonic dependence with respect to M^, Mj and 
/i^. It changes by a large factor (two or more) when these scales are varied around the typical 
hard scale of the process; such a lowest order estimate is not at all quantitative. This is the 
first reason why any tentatively accurate QCD calculation has to be carried out to at least 
next-to-leading order (abbreviated below as NLO). 

Another important motivation is that higher order corrections to some given process 
may reveal new mechanisms, whose rates may be not necessarily negligible compared to 
the leading order contribution. The production of photons is a typical example of this 
phenomenon, as will be explained below. This amounts to large higher order corrections, 
which affect substantially both the magnitude and the shape of the distributions, not due to 
poor apparent convergence of the first terms in the perturbative expansion, but for physically 
understood reasons. 

Finally, finite order calculations may not be accurate enough, as in the case of infrared 
sensitive observables, i.e. observables controlled by multiple soft gluon emission ^ Yet, in 
some less well-known cases of infrared sensitivity, they may reveal perturbative instabilities or 
even divergences plaguing the calculation at any further order inside the physical spectrum 
This is, for example, the case for the transverse momentum distribution of pairs of 
isolated photons, as will be discussed below. The calculation of higher order corrections is 
therefore the first step towards a deeper understanding of what happens in such cases. 

2 Mechanisms of production. 

Schematically, three possible mechanisms may produce prompt photon pairs with large in- 
variant mass: one (which may be called "two direct") in which both photons take part 
directly in the hard subprocess, another one ("one fragmentation") in which one of the 
photons undergoes the hard subprocess while the other results from the fragmentation of a 
hard parton (quark or gluon), itself produced at large tranverse momentum, and yet another 
mechanism ( "two fragmentation" ) in which both photons result from such a fragmentation. 
This schematical splitting into these three contributions emerges from a factorization pro- 
cedure sketched in what follows. Although this splitting provides a convenient picture, one 
must however keep in mind that it is arbitrary; none of these contributions can be measured 
separately. Only their sum is physical. 

2.1 Direct vs. fragmentation mechanisms 

From a topological point of view, a photon produced from fragmentation is with a high prob- 
ability accompanied by a jet of hadrons. From a technical point of view, the lowest order of 

2 See for example |], |, | 
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the "one fragmentation" contribution emerges in the calculation of higher order perturbative 
corrections to the "two direct" contribution given by the Born process qq 77. Some of 
these higher order corrections, such as qg — ^ 977, are plagued by final state coUinear sin- 
gularities associated with the coUinear splitting q — >■ qj. The latter have to be factorized 
and absorbed into quark and gluon fragmentation functions to a photon, D^/g or g{z, Mj) 
defined at some fragmentation scale Mj. Analogously to the so-called anomalous compo- 
nent of the photon structure function, a coUinear logarithmic enhancement occurs, induced 
by the pointlike quark-photon coupling. To all orders in a^, this phenomenon results in 
D-y/q or g{z, Mj) bchavlug asymptotlcally as a/as{Mj). This compensates the one extra 
power of as involved in the short distance subprocess, so that fragmentation contributions 
are asymptotically of the same order as the Born term, by power counting in a^. What is 
more, given the high gluon density at LHC, the gq (or q) initiated process involving one pho- 
ton from fragmentation even dominates the inclusive production rate in the range 80 — 140 
GeV. 

In turn, higher order corrections to "one fragmentation" reveal the "two fragmentation" 
mechanism. Similarly, a coUinear enhancement associated with each photon fragmentation 
compensates two extra powers of in the short distance subprocess, so that the power 
counting in is here also asymptotically the same as for the Born and "one fragmentation" 
parts. Higher order corrections to both fragmentation contributions have in principle to be 
computed in order to provide a consistent NLO study. This has been done in p. The actual 
quantitative significance of these contributions is discussed in sect. 0. 

2.2 The box contribution 

Beyond this, the gluon-gluon fusion contribution gg — > 77, of the "two direct" type, cannot 
be neglected. Indeed, although it is an 0{a'^al) i.e. next-to-next-to-leading order contribu- 
tion, the suppression due to higher powers of as is compensated by the large gluon luminosity 
at colliders. This is especially true at LHC in the relevant range for Higgs search, where the 
so-called box contribution has the same magnitude as the Born term, roughly 50 to 80 %. 

Moreover it is the lowest order of a new mechanism, whose spurious scale dependences are 
thus monotonic, and only higher corrections to it would reduce the sensitivity with respect 
to spurious scale dependences. Finally, this lowest order is a 2 — > 2 process which yields only 
back to back photons in the direction transverse to the beam axis; it gives no contribution 
to the tail of the transverse momentum distribution of photon pairs. An evaluation of the 
distortion of the transverse momentum distribution of photon pairs due to the process of 
gluon-gluon fusion requires the computation of at least the next order correction 0. The 
sum "two direct -|- box" will be refered to as the "direct" contribution. 



•^and in some given factorization scheme. Here we use the MS scheme, |Q. 

''i.e. when the fragmentation scale M'j (chosen of the order of the hard scale of the subprocess) is large 



compared to any typical hadronic scale ~ 1 GeV^ . 
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3 Isolation 



At TeV colliders, the fragmentation contributions are far from negligible. In particular, the 
"one fragmentation" component dominates the inclusive production of photon pairs in the 
lower range of the invariant mass spectrum; in the range of interest for Higgs search at LHC 
it happens to be 2 to 5 times larger than the "two direct" contribution, depending on choice 
of scales 0. A NLO evaluation of the fragmentation contribution is thus necessary to 
have a tentatively reliable prediction. 

Actually, collider experiments do not measure inclusive photon pairs. Isolation cuts Q are 
imposed experimentally to drastically reduce the gigantic background coming from decays of 
TT^ and rj mesons. Schematically, a candidate-photon is considered isolated if, in some given 
cone in azimuthal angle and rapidity about the photon defined by 



+ {y-y,Y<R\ (3.1) 

the deposited hadronic transverse energy El^'^'^ is smaller than some maximal amount EJp"-^, 

E!^""^ < E^"^ (3.2) 

R and being fixed by the experiments. Such isolation cuts affect also the production 

rate of direct photon pairs, especially the "single-" and "two fragmentation" contributions, 
whose topologies are similar to the one of the background. Those are severely reduced when 
£]max jg chosen to be very small compared to the transverse momenta of the photons. 

Yet a NLO evaluation of fragmentation contributions is still relevant for various reasons. 
First, the actual isolation cuts used by collider experiments may be quite more complicated 
than the schematical criterion given by eqns. ( p.l| , |3.2| ). Higher order partonic calculations 



are not designed to account for such criteria accurately, contrary to Monte-Carlo event 
generators such as PYTHIA Q or HERWIG |^. Since these Monte-Carlos and NLO partonic 
calculations are based on different QCD approximations, it is worthwhile to compare these 
two approaches whenever possible, as for the inclusive production rate, as well as with 
rather simple isolation cuts such as the one of eqns. (|0| , p.2|) . Secondly, the above cone- type 
isolation criterion induces infrared sensitivity inside the physical spectrum for observables 
such as the qt spectrum of photon pairs. This effect appears at the NLO - and every higher 
order - in the "one fragmentation" component, as will be shown in the next section. 



4 Phenomenology 

In earlier works on di-photon production |jlO|, only the "two direct" contribution was cal 



culated at NLO, while the fragmentation contribution included only the lowest order "one 
fragmentation" part f\ Moreover, these works were not implemented in a form suited to com- 
pute observables such as the invariant mass relevant for Higgs search, nor flexible enough to 

^As already mentioned in the beginning of sect. this statement is strongly fragmentation scale 
dependent. 

^More about isolation issues in processes involving direct photons is discussed in 0. 
^The "box" contribution was included too. 
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Figure 1: Diphoton differential cross section dajdm^^ vs. m^^, the mass of the photon pair, at 
the Tevatron, = 1.8 TeV. Preliminary data points (statistical errors and systematics in quadrature) 



from the DO collaboration [|12| are compared to the theoretical predictions; the full NLO prediction is 
shown as the solid line while open squares (open circles) represent the single (double) fragmentation 
contribution. 
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Figure 2: Diphoton differential cross section da/dpr vs pj-, the transverse energy of each photon, 
at Tevatron, = 1.8 TeV. Preliminary data points (statistical errors and systematics in quadrature) 



from the DO collaboration [12| are compared to the theoretical predictions; the full NLO prediction is 
shown as the solid line while open squares (open circles) represent the single (double) fragmentation 
contribution. The ratio data/(full NLO theory) is shown below. 
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Figure 3: Diphoton differential cross section da/dqT vs. qx, the transverse momentum of the photon 
pair, at the Tevatron, ^fS = 1.8 TeV. Preliminary data points (statistical errors and systematics in 
quadrature) from the DO collaboration |12| are compared to the theoretical predictions; the full NLO 
prediction is shown as the solid line. 
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Figure 4: Diphoton differential cross section dajd(\)~f~f vs. (/>^-y, the azimuthal angle between the two 
photons, at the Tevatron, = 1.8 TeV. Preliminary data points (statistical errors and systematics in 
quadrature) from the DO collaboration |12| are compared to the theoretical predictions; the full NLO 
prediction is shown as the solid line. The direct contribution is shown as the dashed line while open 
squares (open circles) represent the single (double) fragmentation contribution. 
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accomodate experimental selection cuts. A further refinement |TT[ has implemented the same 
approximation in a more flexible approach combining analytical and Monte-Carlo integration 
techniques, thus allowing the computation of several observables within the same calcula- 
tion, and the possibility to account for selection/isolation cuts. Two recent developments 
are presented in this workshop (see also C. Balazs' contribution). 

We have implemented the "two direct", "single-" and "double fragmentation" contri- 
butions at NLO accuracy, together with the box gluon-gluon contribution, into a general 
purpose computer program of "partonic event generator" type (DIPHOX) presented in de- 
tail in p. The results which we present here are derived from this analysis. 



4.1 Comparison with Tevatron data 

The NLO results agree with the preliminary DO data ]12|] reasonably, as seen in Figs. (|l|- 
except for the tails of each photon's transverse momentum distribution da/dEx and 
the invariant mass distribution of pairs da/dM^-y at large Et and M-y-y respectively, where 
the three highest data points are affected by correlated systematic uncertainties due to 
background evaluation in both cases. However, more instructive conclusions will be drawn 
after a finalized understanding of the systematics, and even more so after the Tevatron Run 
II with the statistics improved by a factor of 20. 



4.2 Estimates for LHC 

We now give some theoretical estimates in the domain relevant for Higgs search at the LHC, 
for the invariant mass distribution cf. Fig. @ with and without isolation. One has to keep 
in mind that the theoretical uncertainties are still large. Firstly these results are still plagued 
by rather large scale uncertainties, as discussed below. Secondly, for a given scale choice, 
they may still underestimate the actual background to Higgs search. 



4.3 Critical examination of various theoretical issues 
4.3.1 Scale uncertainties 

As mentioned above all results depend on three unphysical scales. Varying these between 
and 4M^^ along the first diagonal /i^ = = M|, the NLO results for the invariant 
mass distribution appear surprisingly stable, since they change by about 5% only. Alterna- 
tively, anti-diagonal variations of fi"^ and = Mj in the same interval about the central 
value M^^ lead to a variation still rather large (up to 20 %). This is because variations with 
respect to /x^ and act in opposite ways. When /x^ is increased, as(/i^) and hence the 
NLO corrections decrease; on the other hand the relevant values of the momentum fraction 
of incoming partons are small, ~ O{10~^ to 10~^), so that the gluon and sea quark distribu- 
tion functions increase when is increased. Scale changes with respect to fi"^ and thus 
nearly cancel again each other along the first diagonal but add up in the other case. Actually, 
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the stabihty along the first diagonal is accidental at this order These observations hold 
separately for the box contribution. 

In conclusion, the /i^, dependences are thus not completely under control yet. On the 
other hand, accounting for the NLO corrections to the fragmentation components provides 
stability with respect to Mj variations about orthodox choices of the fragmentation scale. 

4.3.2 Quantitative importance of fragmentation contributions 

For orthodox choices of the fragmentation scale, Mj of order M^^, the "single fragmentation" 
contribution is small at Tevatron, given the stringent isolation cuts used, and the "two 
fragmentation" one is even smaller. For example, as can be seen in Figs. (|l],0) the "one 
fragmentation" contribution is about one order of magnitude less than the "two direct" one. 
It may still have a small visible effect, as in the tail of the azimutal angle distribution da/d(j)^^ 
(0^^ being the azimutal angle between the two photons of a pair) in the low range, cf. 
Fig. (§. 

The situation is the same for LHC predictions, see Fig. (|^). Contributions from frag- 
mentation are drastically reduced when very stringent cuts are imposed, e.g. E^"'^ = 2.5 
GeV in i? = 0.4. However, in practice such isolation cuts will be nearly saturated by under- 
lying events: their veto on the hard event itself is thus even more severe, allowing almost no 
transverse energy leakage from the hard process inside the cone. This may be experimentally 
most suitable. However, requiring that no transverse energy be deposited in a cone of fixed 
size about a photon is not infrared safe order by order in perturbation. It means that finite 
but very stringent isolation cuts imposed in fixed order partonic calculations would lead 
to unreliable results. When less severe isolation cuts are used, the "one-", and to a lesser 
extend, "two fragmentation" components are subdominant but not negligible. 

4.3.3 Infrared sensitive distributions 

Being based on a finite order calculation, our computer code is not suited for the study of 
observables controled by multiple soft gluon emission [jl], |^. Among those, one may distin- 
guish the following examples, most of which would require an improved account of soft gluon 
effects. 

The transverse momentum distribution of pairs da/dq^ near qx = 

The problematics of the "two direct" contribution is similar to the well-known Drell-Yan 

process, see ^ . On the other hand, the fragmentation contributions do not diverge order by 

^In processes for which the lowest order involves some power of a^, an explicit dependence appears in 
next-to-leading order correction, which partially compensates the fjl^ dependence in as 

(^2). Unlike this, m 

the two direct component which dominates the cross section when a drastic isolation is required, the lowest 
order involves no as- The explicit dependence would thus appear only at 0(0^.), i.e. at next-to-next-to 
leading order. At next to leading order, the /i^ dependence occurs only through the monotonous decrease of 
the asifi^) weighting the first correction: there is no partial cancellation of fj? dependence. The mechanism 
is more complicated in the fragmentation components, and the situation becomes mixed up between all 
components when the severity of isolation is reduced. 
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Figure 5: Diphoton differential cross section da/dm^r^ vs. m^^ at the LHC, = 14 TeV, without 
and with isolation criterion Exmax = 5 GeV in ii = 0.4. Same kinematic cuts as in Fig. @. The scale 
choice is M = Mf = /x = 
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Figure 6: Splitting of the diphoton differential cross section da/dniyy at the LHC, s/S = 14 TeV 
with isolation criterion Exmax = 5 GeV in i? = 0.4, into the "direct" , "one fragmentation" and "two 
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kinematic cuts are applied: prili) > 40 GeV, pt(72) > 25 GeV, |j/(7i,2)| < 2.5. 
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order when qx — > 0. Indeed, in the "one fragmentation" case, 

parton 1 + parton 2 — > 71 + parton 3 (4-1) 
parton 3 I2 + X (4.2) 

the NLO contribution to the hard subprocess ( [4. ID yields a double logarithm 

~ a<jln^ l|Pr(7i) + PT(parton3)|| (4.3) 

when Pt'(7i) + pj.(parton3) 0. However the extra convolution associated with the frag- 
mentation (|4.2|) involves an integral over Z2 = pT(72)/PT(parton3) which smears out this 
integrable singularity. The "two fragmentation" contribution involves two such convolu- 
tions, and hence one more smearing. 

The azimuthal angle distribution da/dip^^ near = tt 

This case differs from the previous one for two reasons. Firstly, not only the "two direct" 
contribution diverges order by order when (p^^ tt, but also both "single-" and "double- 
fragmentation" contributions do, as can be seen in Fig. (§). Moreover, in both fragmentation 
cases, soft gluons may couple to both initial- and final-state hard emitters. Indeed, consider 
the example of the "one fragmentation case", cf. eqns. (|4.1| ). Selecting — >■ vr emphasizes 
0(parton3) — 0(7i) — vr, so that all the emitted partons besides parton 3 have to be coUinear 
to either of the incoming or outgoing particles, and/or soft, which yields double logarithms 

~ a, In^ [tt - (0(parton 3) - 0(7i))] (4.4) 

associated with each of the hard partons 1,2,3 - plus single logarithms as well. For the 
observable da/dcf)^^ near = vr, the integral involved in the convolution of the hard sub- 
process with the fragmentation functions does not smear these logarithmic divergences, since 
the fragmentation variable Z2 = pT(72)/PT(parton3) is decoupled from the azimutal variable 
(/>(parton3) equal to 0(72) in the soft and coUinear limits. A similar explanation holds for 
the "double fragmentation component". An analagous problem affects the qt distribution 
of a pair photon + jet at low qx- 

The azimuthal angle distribution da/dcj)^^ near = 

Both fragmentation contributions to da/dip^^ diverge also order by order when — 0. 
Here also soft gluons may couple to both initial- and final-state hard emitters. Actually, 
given the large invariant mass of the pairs, the vicinity of 0^^ = is never probed, so that 
an improved treatment of soft gluon effects is not needed in this case. Yet, as a consequence, 
an increase of the "single-fragmentation" contribution can be seen | in the lower range of 
the 0-y-y spectrum, cf. Fig. (|). 

An infrared divergence for da/dqx inside the physical spectrum 

Besides the well-known issue at qx = 0, another infrared sensitive point appears in the 

similar behaviour occurs also for the "two fragmentation" contribution, which is however too tiny to 
have any significant effect. 



173 



spectrum due to isolation, at the critical value = EJp°'^ . Indeed, at lowest order the "one 



fragmentation" component is not smooth, it instead behaves as a step function [|13 

"single fragm'\{LO) 



da 
dqr 



oc e [EJp"^ - qr) (4.5) 



Consequently, in agreement with the general study of the NLO - and every higher order 
- correction has a double logarithmic divergence at the critical point. Such singularities are 
very sensitive to the kinematical constraints and the observable considered. The present 
case has a double logarithm below the critical point. 



da 




"single fragrrC\{N LO) 

OC -a, In^ (^r" - qr) & {E^" - Qt) + ■ ■ ■ (4.6) 

The infrared sensitive behaviour at this critical point can be infered on Fig. (^, where 
a rather large value for EJp""^ is used in order to split this critical point from the small 
region. 

This effect is not visible on the theoretical prediction of Fig. (^) because of the low value 
£]max _ ^ GeV used by the DO collaboration. The two infrared sensitive regions (g^ 
and qr ~ EJp"'^) are not separated enough, and the bin smearing averages over both effects. 
The phenomenon is thus camouflaged. An all order summation of this soft gluon effect has 
to be carried out to restore a sensible shape to the distribution. A similar observation 
would be made about the qx distribution of a pair {photon + jet}. 



5 Perspectives 

Future improvements of the theoretical understanding of di-photon production will require 
various inputs. Next-to-next to leading order corrections would hopefully stabilize the scale 
dependences and correct the normalisations. Another important quantitative improvement 
would be the higher order corrections to the box contribution, which is within reach thanks 
to some recent achievements. Progresses in this direction are reported on elsewhere in these 



proceedings [14 



This accounting of multiple soft gluon effects, already implemented in |^ for the "two 
direct" contribution in absence of isolation cuts, is needed in the DIPHOX-hased study to 
provide correct distributions in the infrared sensitive regions. This affects the qT distribution 
at low qr- It also concerns the (p^^ distribution when — > vr, which has been less studied, 
and for which the soft gluon effects in the fragmentation cases are more involved than in the 
"two direct" case [^, r^|. It is also important for the qx distribution in the vicinity of the 



critical point = EJp"-^, induced by the fixed cone type isolation criterion. 

The understanding of the background to the Higgs search is quantitatively not yet on 
the same footing as for the signal. Hence, accounting for the higher order correction to the 
signal in numerical simulations might be instructive, but the results should be considered 
with care in order to avoid statements that are too optimistic. 
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Figure 7: Diphoton differential cross section da/dqT at LHC, s/S = 14 TeV, with isolation criterion 

Ermax = 15 GeV in = 0.4. 
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6 Higgs search in association with a hard jet 



In order to overcome the present insufficient control of higher order corrections in inclusive 
production, it has been suggested [|T7| to study the associated production of 77)+ jet, for 
which both signal S and background B are lower (but still at the level of hundred signal events 
at low luminosity). The lowest order estimate has shown that the S/B ratio is improved 
critically (up to 1/2 — 1/3) with the same level of the significance S/^/B. Furthermore, 
higher order corrections to the background have been shown recently [l^ to be under better 
control than in the inclusive (i.e. unassociated) case. 



6.1 Background: associated vs. inclusive 

Indeed, in the inclusive case, the magnitude of the NNLO box contribution is comparable 
to the LO cross section essentially because the latter is initiated by qq, whereas the former 
involves gg. The gg luminosity, much larger than the qq one, compensates numerically the 
extra factor of the box. On the contrary, in the channel 77 + jet, the LO cross-section 
is dominated by a qg initiated subprocess. The qg luminosity is sizeably larger than the qq 



one, so that the corresponding NNLO remains small compared to the LO result |T^. Thus, 
expecting that the subprocess gg '-ffg gives the main NNLO correction, a quantitative 
description of the background with an accuracy better than 20% could be achieved already 
at NLO in the 77+ jet channel for a high pj- (> 30 GeV) jet. All the helicity amplitudes 
needed for the implementation of the ("direct" contribution to the) background to NLO 
accuracy Qare now available [0, |2l| . 



6.2 Signal vs. background 

The origin of this improvement of the S/B ratio is the following. At LO, the signal of 
associated 77) +jet production has basically a 2-body kinematics, due to the extremely 
small Higgs width (a few MeV). On the contrary, the LO background contributions qq —>■ jjg 
and gq ^ 'y + 'y + q (as well as the NNLO gg —>■ '-ffg one) have 3-body kinematics. This is in 
contrast to the inclusive channel where both signal and background LO subprocesses have 
2-body kinematics. This circumstance opens the room for more refined cuts to suppress the 
background more efficiently |]T^ . The contributions qq —>■ 77(7 and gg — >■ 775^ are suppressed 
down to 40% of the signal (to be compared with S/B ~ 1/7 for the inclusive channel). 
Unfortunately, as found in |jT^, the contribution of the other subprocess gq —>■ 775, which 
dominates, yields the overall ratio S'/i?~l/2 — 1/3. 

The cuts which allow this efficient suppression of the irreducible background at LO are 
based on the differences in the shapes of the angular distributions in the partonic c.m.s.. 
Due to helicity and total angular momentum conservation, the S-wave does not contribute 

lOWe remind the reader unfamiliar with the LO, NLO, NNLO, etc, terminology that this terminology does 
not refer to the absolute power of as involved, but instead to the relative power with respect to the Born 
term of the process considered. Hence, NLO corrections to 77+ jet are also part of the NNLO corrections 
to 77 inclusive (converse not true, cf. box). 
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to the dominant signal subprocess gg — > Hg. On the contrary, all angular momentum 
states contribute to the subprocesses gq •yyq and qq lig- Therefore, the signal has a 
more suppressed threshold behaviour compared to the background. The S/ B ratio can thus 
be improved by increasing the partonic c.m.s. energy far beyond threshold, and a cut 
\/I > 300 GeV has been found to give the best S/B ratio for the LHC. Actually, the effect can 
not be fully explained by the threshold behavior only, since that would result in a uniform 
suppression factor. It was shown in [0, ^ (see Figs. 5 and 6 there) that the dependences of 
the background and the signal on the c.m.s. angular variables are quite different; therefore, 
the strong s cut affects them with different suppression factors (see [0, 0] for more details). 
This effect can be exploited to enhance the significance S/\fB at the same level as S/B. If 
the cut cos('i9*)(j7) < —0.87 on the jet-photon in the partonic c.m.s. is applied for \f§ < 300 
GeV and combined with the cut > 300 GeV, the change on S/B is rather small, while the 
significance is improved by a factor ~ 1.3. The same effect can be observed with the cut on 
the jet angle in the partonic c.m.s. (^9*(j), cf. Fig. 5 of [T^, but one should notice that 
the two variables, '&*{j'y) and are correlated. Therefore, it is desirable to perform a 

multi-variable optimization of the event selection. Notice that the present discussion is based 
on a LO analysis, and concerns only what was defined above as the "direct" component of 
the irreducible background. One now has to understand how this works at NLO. 

Other, reducible, sources of background are potentially dangerous. The above-defined 
"one fragmentation" component to the so-called irreducible background, and the reducible 
background coming from misidentification of jet events were treated on a similar footing 



in the LO analysis of [|T^ as a de facto reducible background. In |T^, a rough 
analysis found that this reducible background is less than 20% of the irreducible one after 
cuts are imposed. The misidentification rate is given mainly by the subprocesses gq "ygq, 
99 ~^ ll'i and qq' 'jq{g)q'{g), when the final state parton produces an energetic isolated 
photon but other products of the hadronization escape the detection as a jet. There, a 
l{'^^)/j^t rejection factor equal to 2500 for a jet misidentified as a photon and 5000 for a 
well separated 7(vr°) production by a jet were used. No additional vr" rejection algorithms 
were applied. Furthermore, this reducible background is expected to be suppressed even 
more strongly than the irreducible background of "direct" type when a cut on \/S is applied. 

In summary, the associated channel i?(— > 77)+ jet with jet transverse energy Et > 30 
GeV and rapidity \ri\ < 4.5 (thus involving forward hadronic calorimeters) opens a promising 
possibility for discovering the Higgs boson with a mass of 100-140 GeV at LHC even at low 
luminosity. However, to perform a quantitative analysis, the NLO calculations, namely of 
the background, have to be completed, and included in a more realistic final state analysis. 
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